TheaterFire

Intel preparing Arc (PRO) "Battlemage" GPU with 24GB memory - VideoCardz.com

Posted by brown2green@reddit | LocalLLaMA | View on Reddit | 218 comments

Reply to Post

218 Comments

SocialDinamo@reddit

24 Gigs on a single slot card... Thank god. Let's get rid of these goofy oversized plastic shrouds so we can get more cards on the mobo. Multi gpu support for basic inference is all I ask. I have a feeling this upcoming year will be a lot of inference time computer and long generations. Excited for this!
View on Reddit #44238534

mycall@reddit

Is there a market for 48GB cards?
View on Reddit #44242618

Paganator@reddit

It would be perfect for home users and small businesses wanting to run AI locally. It's a niche, but a sizeable one.
View on Reddit #44244171

desmotron@reddit

Will continue to grow. Once the “personalAi” craze takes off it will be like the “personal computer” all over again. Wouldn’t be surprised to see Tb vram cards in 4-5 years, def in 10 years. At that point i assume it will be more specialized, possibly not even a video-card anymore. We need the next Bill/Steve/Jeff to put it all together
View on Reddit #44246860

Mochila-Mochila@reddit

APUs will be the norm, I guess. Apple M series and AMD's Strix Halo are showing the way it's meant to become.
View on Reddit #49257442

Dead_Internet_Theory@reddit

10 years ago GPUs were... GTX 980. With 4GB. If the trend continues, in 10 years we'd have 256GB cards, maybe.
View on Reddit #44625796

OkWelcome6293@reddit

I would be surprised to see cards reach TB levels. Making very wide (in terms of memory) graphics cards is expensive, and it’s likely going to be cheaper to use a high-speed network to scale. The industry is already working on this.
View on Reddit #44250411

martinerous@reddit

I often see new promising technologies floating around, memristors, new types of wafers etc., but somehow there hasn't been a mass-production "disruption" still. Hoping for the next year.
View on Reddit #44277933

OkWelcome6293@reddit

There is no disruption right now because the people who make GPUs are happily rolling in cash.
View on Reddit #44278205

Xanjis@reddit

As soon as average user has 16+ GB expect games and applications to start implementing local AI as well. 
View on Reddit #44247805

Dead_Internet_Theory@reddit

This. Imagine if a game has NPC AI you can talk to, and "high settings" for that means more VRAM - games *will* have this, it's a matter of when. Right now, games would have to sacrifice too much in graphics to fit an LLM in a reasonable configuration though.
View on Reddit #44249944

WearMoreHats@reddit

> it's a matter of when Having previously worked in ML for the games industry, this is still pretty far off for mainstream games. But I think we'll start to see it slipping into always-online games where they can run the AI workload in the cloud.
View on Reddit #44250459

Dead_Internet_Theory@reddit

I think you could pull off some level of interactions with 1B-3B models. Like a character understanding the basic of what you said, and just choosing between one of several curated courses of actions. LLM doesn't have to be a chatbot *directly*.
View on Reddit #44252126

EstarriolOfTheEast@reddit

Have you tried to this at volume? Comprehension at the 1B-3B is definitely not at this point yet. Beyond conversation (which I think few games are such that users will want to spend more of their time talking than fighting or exploring), is powering AI actions, from enemy planning to NPC everyday routines (to make the world feel more alive). For this, the smallest borderline acceptable size I've found is 14B unless the game's rules are really simple, with no action requiring reasoning over multiple steps. I'm hoping models released this year in the 3B range get smart enough to power something interesting that a sizeable number of users can run locally.
View on Reddit #44259190

Dead_Internet_Theory@reddit

You definitely don't need 14B. What you need is to rely less on the LLM being a magic prompt-understanding black box and more of a relatively flexible, but focused, decision maker. You can't just show the LLM's output to the user or treat it like lines of dialogue; for that even 14B is far too tiny. But as something like a sentiment classifier, keyword extractor, etc; then small models can do it. Say, a detective game where you have to type what you think happened at the crime scene, but the lines of dialogue are themselves scripted (and thus, much better written than what an AI can make). For constraining LLM outputs you can use things like [GNBF grammar](https://github.com/ggerganov/llama.cpp/blob/master/grammars/README.md).
View on Reddit #44626531

EstarriolOfTheEast@reddit

That limits you to a constrained/small class of games where such simple classifiers can be made use of. But I was speaking more generally, such as controlling the AI for a wizard in a complex magic system. Or enemy AI that leverages the environment for strategies against the player. Stuff like that. Conversation is actually one of the less interesting uses for a game designer.
View on Reddit #44628816

Dead_Internet_Theory@reddit

Think of a game like Event\[0\]. That was seen as groundbreaking and impressive at the time. The dialogue was of course scripted, since LLMs weren't even a thing in 2016; but the magic of that game was that you could just talk to the robot with text. All that work they had to put into a custom NLP solution is now trivial to implement with a tiny LLM. Regarding "AI that leverages the environment for strategies"; honestly even huge LLMs might struggle with this; they have poor spatial reasoning. You're better off using basic algorithms for that (or even a neural network trained for hours on self-play) and just using LLMs for *language*.
View on Reddit #44653261

EstarriolOfTheEast@reddit

Again, that's an example that falls within the same very limited range of games. No spatial reasoning would be needed with the right state/data representation and structures. If the models are not intelligent enough to enhance existing GOAP type AI (it won't be delegating everything to the AI, think of it more like function calling and enhanced graph based reasoning) then they will be useless for most other uses except autocomplete. It's worth aiming higher; models have been getting better with time. With the latest Qwen2.5-14B the least worst I have tested.
View on Reddit #44658532

Dead_Internet_Theory@reddit

I think if you have such a fixed system (I assume you read in some structured text format with game state like JSON maybe, and ask the model to output JSON), finetuning would be the way to go. You can probably finetune a smaller model entirely locally. You could end up with a finetuned Qwen2.5-3B that outperforms non-finetuned Qwen2.5-14B.
View on Reddit #44698563

EstarriolOfTheEast@reddit

Fine tuning is not helpful. Generally, particularly in a game setting, if we can constrain the problem enough to finetune, there'll be much better reliable approaches available.
View on Reddit #44704886

WearMoreHats@reddit

I think we'll see smaller indie games experimenting with this in the near future but it's going to be a good while before AAA's are using it. Game dev timelines are really long now and devs will be wary of adding something like this in at the last minute to a game that's releasing soon, especially when the tech is still changing so rapidly. And they won't want to lose out of potential players for a "nice to have" feature if it significantly increases the game's required specs. Personally, I'd love to see a classic murder-mystery game using LLM powered NPCs. There's a dinner party, someone gets murdered, you have to interview the guests/suspects and explore the house for clues. Each guest has their own backstory, personality, and information about the night in question. The key difference is that you as the player have to come up with the questions base on what you've learned, rather than a case of "I found a knife with the initials S.B, so now when I talk to Steve Banner it gives me a new dialogue option".
View on Reddit #44262809

Dead_Internet_Theory@reddit

\> but it's going to be a good while before AAA's are using it Fine by me, I mostly only buy indies anyway. The AAA industry isn't what it used to be.
View on Reddit #44625149

MagoViejo@reddit

This made me think that the next Fallout game will have it implemented.
View on Reddit #44268441

AntDogFan@reddit

What hardware would be needed to do this now? I am not talking about someone making a mass market game but more someone making a simple game with a local LLM.
View on Reddit #44250222

Dead_Internet_Theory@reddit

I think about the lowest configuration anyone capable of generally purchasing videogames has is 6GB VRAM, 16GB RAM (lower than that I imagine they almost only play f2p or pirate). That's obviously too low, but if you can make something that makes the best use of a 3B model or 7B with offloading, you could make it work, and have higher settings modes. It starts to get good at 16-24GB, where you can run 12B-22B+ models. Personally, I think a game could make use of chain of thought for characters; make them classify your input, polish the response, double check it, have curated responses, things like that (making small models seem smarter).
View on Reddit #44252718

ReasonablePossum_@reddit

Imho it will end up with nvidia and amd just having to release high vram cards to fit their own models in there so videogame devs can just prompt them for their games, instead of bloating each game with their own LLM.
View on Reddit #44250567

Dead_Internet_Theory@reddit

Knowing Nvidia they wouldn't mind if you have to have a subscription to the Omniverse to be able to play your games... $10/month with ads (your Skyrim 25 year edition NPC occasionally recommends a fresh glass of Mountain Dew(tm) and some crispy, crunchy Doritos)
View on Reddit #44251673

durable-racoon@reddit

\> wanting to run AI locally maybe - but most of the software libraries depend on CUDA. Can you even get AI models running without cuda? easily? with good performance? if I have to do a bunch of extra work and use non-standard tools to get the model running that sucks!
View on Reddit #44607548

Paganator@reddit

Sure, but if Intel releases a card that's perfect to run AI locally but lacks software tools, then that encourages open source developers to support Intel cards better, which is what Intel wants. They have to offer something that's not just a card that's worse than nVidia's cards in every way.
View on Reddit #44608540

durable-racoon@reddit

maybe - but AMD has been around forever and hasn't caught up to Cuda/CuDNN
View on Reddit #44614812

Uwwuwuwuwuwuwuwuw@reddit

How about a 92GB card?
View on Reddit #44262585

Adventurous_Train_91@reddit

You can run some pretty powerful models locally with 48GB. I don’t think llama 3.3 70b requires that in some of the smaller quantisation versions. Would also be cool to run the reasoning models locally for longer, rather than being limited to 50 a week for o1 right now with ChatGPT plus
View on Reddit #44254133

Unlucky_Ad4879@reddit

48GB cards already exist and can be bought [Like this one](https://www.newegg.com/p/1FT-000P-004A0?Item=9SIB1PGK8E1530)
View on Reddit #46732857

Ggoddkkiller@reddit

I was saying B700 will have 24GB and it looks like will happen. Next is a B900 card that i'm hoping 40GB VRAM around 800-900 dollars. If it happens it will brake AI customer market entirely..
View on Reddit #44255368

mycall@reddit

I hope they do that with the B900.
View on Reddit #44259650

SteveRD1@reddit

If we see that, NVIDIA is dead to me!
View on Reddit #44335971

SteveRD1@reddit

I'd pay $1000 for an Intel 48GB card. Sure it wouldn't have all the Nvidia goodness, but I could get 2 or 3 at a good price for a decent performing large model at home!
View on Reddit #44335590

candre23@reddit

Intel is already at a place where their compute is the bottleneck. A 24GB card would be struggling to take advantage of models that require that much VRAM at reasonable speeds. 48GB you're talking about 70b models, and battlemage (combined with poorly-optimized software) isn't up to the task.
View on Reddit #44254774

mycall@reddit

How slow is slow though? If I could run a 70B+ model on an Arc PRO, I would invest in that if I found it useful slow be damned.
View on Reddit #44259764

tmvr@reddit

You can't, at least not one one, you would been two. With 24GB you can't fit in a 70B model even with IQ2 and 4K context and you don't want that 2.76BPW anyway. Now with two cards the 72B Llama 3.x would run in Q4\_K\_M and 8-16K context, that would be nice and worth the investment into two 24GB Intel cards if they are cheap enough.
View on Reddit #44325743

candre23@reddit

Nobody knows because the card doesn't exist. But a theoretical 48GB blackmage card is unlikely to be faster than a pair of old P40s in practice running a 70b model.
View on Reddit #44260703

Thellton@reddit

a single card without the complexities of cooling and powering that a pair of old P40s have is still pretty damn good if you ask me. after all, you could then go to 96GB with the same complexity as the P40s.
View on Reddit #44273772

candre23@reddit

The problem being that you're looking at low-single-digit t/s with P40s, and probably the same on a 48GB intel card. Sure, you *can*. But with such a card surely costing well over a grand for the kind of performance you can get from a pair of P40s costing less than half that, why would you?
View on Reddit #44274026

Thellton@reddit

given that we don't know what sort of price Intel would go for for the hypothetical 48gb card, but given that we know Intel are looking to gain market share; if they decide to do $USD1000 per card that'd be pricey but not entirely unreasonable as compared to NVIDIA or AMD. And given Intel cards are already very good for media transcoding, they'd have niche outside AI as well, expanding their utility. Furthermore it'd have warranty and integrated cooling like a brand new A6000 which in Australia goes for 9000+ AUD, so for a patient bugger like me? 48Gb in a single card for a hypothetical price like that would be immensely tempting.
View on Reddit #44279591

psychicsword@reddit

I believe the problem is that the GDDR memory chips don't come in sizes that make it particularly easy to have high density cards without also scaling up all of the other parts of the card.
View on Reddit #44247399

Dead_Internet_Theory@reddit

RTX 5090 is rumored to have 16 x 2 GB GDDR7 modules; I believe Micron and Samsung will make 3GB and 4GB modules, but the JEDEC spec alows for 6GB and 8GB too. Technically, it might be possible some crazy guy makes a frankensteined 64GB 5090 like those Russians and Brazilians that modded previous cards.
View on Reddit #44250281

psychicsword@reddit

I believe the thinking is that we are getting a 512 bit memory bus to make that happen. The 4090 had a 384-bit bus which combined with 2GB chips only allowed for 24GB of vram. So we are actually seeing them scale up the complexity to make that rumored 32GB possible. The increased spec for larger Vram chips is what will really pay dividends for home AI long term because it could allow for additional skus that optimize for high capacity with moderate performance without also needing the complexity of a 512 bit memory bus adding to the cost.
View on Reddit #44255086

tmvr@reddit

You can have a single card with 48GB VRAM using GDDR6 like the Intel card is using, you have that with the pro cards as well. One module is 32bit and the largest size is 2GB. A card with a 384bit bus can have 24GB in a 12 x 2GB at 32bit configuration like the current consumer cards have that we use (3090/Ti, 4090, 7900XTX) or you can have 24 x 2GB at 16bit clamshell mode each (2 RAM chips share the 32bit wide controller in a 2x16bit configuration) like the professional 48GB (or 32GB with 256bit bus) cards do. I also think it would be great to have a relatively cheap B580 pro 24GB under $500 for local inference.
View on Reddit #44325320

swagonflyyyy@reddit

I have one. Its amazing what I've been able to do with it. I'm thinking of getting a second one next year.
View on Reddit #44247871

RobotRobotWhatDoUSee@reddit

What card do you have?
View on Reddit #44299548

swagonflyyyy@reddit

RTX 8000 Quadro
View on Reddit #44300327

RobotRobotWhatDoUSee@reddit

Oh, very nice. What kind if hardware do you have that in?
View on Reddit #44300680

swagonflyyyy@reddit

Asroxk x670 Taichi Ryzen 7950x 128GB RAM 1500 PSU Just upgraded all that, actually. Works amazing. Planning to get a second RTX 8000 to get 96GB VRAM. After that I've done all I can to max out my PC on a reasonable budget.
View on Reddit #44301793

RobotRobotWhatDoUSee@reddit

Not bad to get 96G VRAM. Is your RTX 8000 Quadro passively cooled? (Googling it, there appears to be both passive and actively cooled versions.) I have a dual P40 setup in a refurb R730, which was great to get feet wet, but now I'm bit by the LLM bug and want to expand. Also it turns out the R730 is quite loud, and I haven't found a great way to make it quieter (and no easy place to put it out of the way). Very curious the noise level of your setup as well.
View on Reddit #44302482

swagonflyyyy@reddit

I have 4 axial case fans but it also comes with a blower fan. Usually doesn't get past 85C which is within normal operating temps. However I've had issues previously running models on anything that isn't llama.cpp. I have to be extremely careful to ensure that I don't push the GPU too far because it can overheat extremely fast and the fans would max out, causing the screen to black out. Strangely enough, the PC still works. I can still hear music and whatnot but get no display. I'm not saying the GPU is fragile or anything but you can accidentally overdo it with some models. Like, if you're rapidly generating images, trying to clone a voice with an extremely long text input, not giving it time to rest between workload-heavy outputs, all these things can overheat the GPU pretty quickly.
View on Reddit #44318719

muxxington@reddit

Are the fans controllable? Then maybe just read out all temperatures you can, take the max temperature of all temperatures und use it to controll the fan speed with fancontrol or whatever. That's more or less how I control my fans. [https://gist.github.com/crashr/bab9d0c6aba238a07bae2b999ee4dad3](https://gist.github.com/crashr/bab9d0c6aba238a07bae2b999ee4dad3)
View on Reddit #44304472

Twistpunch@reddit

It will cannibalise Nvidia’s commercial market. I doubt it will happen anytime soon.
View on Reddit #44267607

fullouterjoin@reddit

Yes there is, it is huge. The most popular AI gpu from NVDA the H100 is 80GB. https://www.nvidia.com/en-us/data-center/h100/ DCs have a fixed volume for accelerators, both in power, space, cooling.
View on Reddit #44259705

mycall@reddit

Cool stuff. Now throw that VRAM onto an Arc PRO and call it a day.
View on Reddit #44259929

TheThoccnessMonster@reddit

It’s already a pain with CUDA unless you hand wire it yourself. You’re NOT going to see this usable anytime soon on two Intel cards.
View on Reddit #44239940

Down_The_Rabbithole@reddit

Intel has really optimized their software stack and it's already way better than that of AMD. Not completely up to par yet with CUDA, but honestly it's not that far off either. It's basically a non-factor now.
View on Reddit #44246401

TheThoccnessMonster@reddit

I… trust you don’t spend much time training models. It’s not even close hombre.
View on Reddit #45678994

Down_The_Rabbithole@reddit

I do training runs as part of my dayjob, that said this comment thread was about inference, hence my comment should only refer to inference performance. Which is terrific on Intel GPUs and works out of the box.
View on Reddit #45698378

RebelOnionfn@reddit

I've been running 2 arc a770s for dual GPU inference and it's been pretty seamless
View on Reddit #44243333

Wrong-Historian@reddit

What's the API that you're using? Vulkan? Can you do tensor-parallel?
View on Reddit #44243562

CheatCodesOfLife@reddit

I have an A770+A750 rig. latest llama.cpp compiles/runs fine with dual GPUs. Intel just updated their ollama bult to version 4.6 within the last week too. They have a custom build of ooba. I couldn't get their vllm build running, I think it's unsupported on ubuntu 24.04 (needs 22.04) but I cbf reinstalling as I just use llama.cpp on this rig. It's ok for a cheap llama.cpp/ollama rig but no tensor parallel, prompt ingestion is slow, can't do xtts2, etc.
View on Reddit #44244941

TheThoccnessMonster@reddit

Ok well the discussion we were having was some guy telling me it’s “basically already there” but what it sounds like is that it’s all totally experimental even for inference to say nothing of training.
View on Reddit #45679147

CheatCodesOfLife@reddit

It's solid for llama.cpp inference. llama.cpp seem to be treating syctl as first-class, and a syctl build failure is enough to block an auto-merge. The other things like ollama are officially supported, but you have to use the specific versoins Intel have built. So rather than experimental, I'd say it's slow to get new versions. I managed to get vllm working eventually, but it was a lot of fucking around and stuck on an old version. Training -- I wouldn't get an ARC for it since you're quite limited with what you can use (axolotl build from Intel was old when I looked). Really I'd only recommend it for cheap GPU inference with llama.cpp at this point, and not for dev (you really need cuda if you want to build your own software / hack away at models, etc IMO)
View on Reddit #45682327

RebelOnionfn@reddit

I've been using Intel's custom ollama image using IPEX. It's been a while since I've run benchmarks as I use it pretty casually, but it's faster than what I need for models in the 30b range with a full 32k context.
View on Reddit #44244416

satireplusplus@reddit

And IPEX is using SYCL (https://www.khronos.org/sycl/) with Intel's OneApi, which is also a supported backend in llama.cpp. See here https://github.com/ggerganov/llama.cpp/blob/master/docs/backend/SYCL.md I've been able to use the iGPU of a N100 succesfully with it.
View on Reddit #44246547

mikaelhg@reddit

How easy is it to get it working? Just a Docker container with some `/dev`s as volumes?
View on Reddit #44254149

satireplusplus@reddit

I'm not a fan of docker and anything docker+GPU is going to be a PITA. Probably needs its own post, but I've posted instructions [here](https://www.reddit.com/r/LocalLLaMA/comments/1hfrdos/rumour_24gb_arc_b580/m2e9nd5/) as a comment.
View on Reddit #44255042

fallingdowndizzyvr@reddit

> You’re NOT going to see this usable anytime soon on two Intel cards. That's funny, I've been running 2xA770s for the better part of a year.
View on Reddit #44267180

ziggo0@reddit

>Goofy oversized plastic shrouds Right? I'm still on my eVGA 2070 Super. It's dual slot, 2 fans - no real bullshit and its heavily overclocked. +135mhz on the core and memory is at +850mhz. The card doesn't over heat, get loud at full load - it's really stupid how bulky cards have gotten. I'm racist against RGB but each to their own lol
View on Reddit #44289450

candre23@reddit

This is an imaginary image. It is extremely unlikely that if intel actually does make a 24GB card, that it will be a single-slot affair.
View on Reddit #44239910

camwow13@reddit

Also single slot cards are generally loud unless it's a low TDP chip. Everyone forgot about the 2000s
View on Reddit #44267588

esuil@reddit

Not really. I have RTX A4000 and nothing about them is loud. Most beautiful GPU I ever had. Sad to be getting rid of it right now.
View on Reddit #44279508

fallingdowndizzyvr@reddit

> Multi gpu support for basic inference is all I ask. That has been around for a good while now. I run 2xA770s with AMD and Nividia GPUs too. I also toss in a Mac for spice. How much more multi do you want?
View on Reddit #44267101

Dead_Internet_Theory@reddit

the plastic shroud isn't the bulk of the size of the card, it's the coolers and radiators. Like, you grab a 3090 or 4090 and it's very heavy from all the copper.
View on Reddit #44249740

KL_GPU@reddit

Previous generation a60 was 175$ msrp (12gb vram), please Intel give US 350-400$ card, Just dreaming, but please
View on Reddit #44239858

satireplusplus@reddit

Hope so too, that would seriously shake up the 3090/4090 enthusiast market. If it's $300, then 2x Intel GPUs with 24GB = 48GB VRAM would be less expensive than a used 3090 on ebay.
View on Reddit #44246789

AiAgentHelpDesk@reddit

5090 could be 3.5k and weekend warriors will still buy it. Nvidia knows people will pay whatever the msrp is.
View on Reddit #44250891

Synthetic451@reddit

Dude no. I hate this narrative that gets parroted around. I bought the 3090 at 1600 because it was a good value back then for the performance and vram (needed it for Blender), but the 5090 is already completely out of the running for my next GPU upgrade because of the rumored price. It's ridiculous. I am waiting for 9070xt benchmarks, but the rumors say it will probably be a side-grade for me if I buy it, so I'll probably just wait another year for my GPU upgrade.
View on Reddit #44276637

msaraiva@reddit

https://i.redd.it/d8v03acomlie1.gif
View on Reddit #48330118

epicwisdom@reddit

>> Nvidia knows people will pay whatever the msrp is. > But I won't!! OK, great, thanks for sharing your opinion. Nvidia's still gonna rake in the cash.
View on Reddit #44373688

TheManicProgrammer@reddit

Gotten to the stage that the GPU is now the price of an entire computer...
View on Reddit #44296361

Swashybuckz@reddit

Its still newer tech imo than a basic PC.
View on Reddit #44768269

satireplusplus@reddit

Actually pricing it higher makes for a fairer market - after all people were willing to buy 3090's for scalper prices ($2k+).
View on Reddit #44253667

epicwisdom@reddit

According to what definition of "fairer?" You could just as well argue that scalpers, who can flexibly raise and lower prices, will track the equilibrium price at any given time much more closely than setting an MSRP and only adjusting a few times a year (incl. sales). On the other hand, a price which is many multiples of the manufacturing cost that prices out the best independent researchers in favor of OpenAI/FAANG, regardless of whether the price is set by scalpers or the OEM, is hardly "fair" to those researchers.
View on Reddit #44374026

satireplusplus@reddit

Fairer as in I can buy a 5090 from a store if I can afford it, have proper warranty and it's not sold out in perpetuity. That's what happened to the 3090. Had to wait years to buy one (used) for a fair price, because fuck scalpers.
View on Reddit #44409789

larrytheevilbunnie@reddit

You’re being downvoted, but your right, scalpers only exist if a company prices below what the market will bear
View on Reddit #44268752

nderstand2grow@reddit

how about producing enough so people can buy directly from you instead of scalpers? and then price it low enough for them?
View on Reddit #44388408

larrytheevilbunnie@reddit

Bro, I pray too god that they overproduce GPUs so I can get cheap ones
View on Reddit #44389148

spamzauberer@reddit

Well it’s still supply and demand. So of course scalpers can do their thing when they a) corner the market by buying all the inventory and b) manufactures not giving a fuck about it because that way they can push the price up themselves. NVIDIA has a huge margin.
View on Reddit #44304557

Anjz@reddit

Not going to lie, I'd still buy it. There's no other competitors and local AI Inferencing/Gaming are two big passions of mine. Feels bad because this gfx retail price inflation would drive more people into console gaming probably. But at the same time it's the only card that would allow you to run specific models and be top of the line for gaming as well.
View on Reddit #44275904

Administrative-Air73@reddit

Same and if I can trade or sell my 4090 in the mean time it'll only be a few hundred to upgrade.
View on Reddit #44361184

nixed9@reddit

Forgive my dumb question but can you actually “link” two intel gpus together and utilize the combined ram? is that just handled at the software level?
View on Reddit #44275652

satireplusplus@reddit

At the hardware level it's still two GPUs. At the software level, the simplest way to use them both for inference is to place half the layers of one large model on one card and the other half on the other card. For each token GPU1 computes something, hands the intermediate representation to GPU2, then GPU2 computes the output token. This works well for inference of a single local session because the bottleneck isn't compute, it's memory bandwidth. For training there are tons of ways to make use of both cards and something like PyTorch has abstractions for the most common approaches: https://pytorch.org/tutorials/beginner/dist_overview.html
View on Reddit #44288394

nderstand2grow@reddit

does intel have something like nvLink?
View on Reddit #44388473

satireplusplus@reddit

afaik no
View on Reddit #44409592

nderstand2grow@reddit

https://preview.redd.it/8oueopf6k4ae1.png?width=1322&format=png&auto=webp&s=ee73c1ad03ff6cea478b7a5523d7b93c46f118d4 looks like the bandwidth isn't that good though, it's slower than a Mac Studio with M2 Ultra.
View on Reddit #44388779

martinerous@reddit

As long as it's under 600$, it would be a good option to consider over a used 3090. And, of course, after they get Pytorch and everything else mainstream running smoothly on all platforms (I'm on Windows, using KoboldCpp for LLMs and ComfyUI for images/video/TTS - it all must work at least as well as with CUDA to make it worth considering an Intel GPU).
View on Reddit #44277552

Buttonskill@reddit

Surprised to see this comment so far down. I'm right there with you - Optimistic, but pragmatic. On the one hand, I'm excited for the prospect of a cheap competitor, but there's a lot of work to be done in SW support before an Intel GPU is ready to battle CUDA. Nvidia played the long con, cornered us all with that dependency, and I want out.
View on Reddit #44280800

nderstand2grow@reddit

I really hate nvidia exactly for their monopolistic shenanigans. they make great gpus but f*** their greedy marketing.
View on Reddit #44388527

SwanManThe4th@reddit

Having looked at Intels AI/HPC sdks... They are stacked. They have ipex, OpenVino, MKL, MPI and then all the stuff under OneAPI. Meanwhile AMD can't even get 3 Devs together to implement a stable diffussion on a MiGraphX backend (AMDs equivalent to TensorRT). Had to compile ROCm myself for a decent experience.
View on Reddit #44379210

Secure_Reflection409@reddit

Intel has got a lot of grovelling to do. $300 max
View on Reddit #44264310

NarrowTea3631@reddit

you want fries with that?
View on Reddit #44296510

Unlucky_Ad4879@reddit

I'd just like to say there's a 48GB GDDR6 Vram card out there [Costs a small fortune though](https://www.newegg.com/p/1FT-000P-004A0?Item=9SIB1PGK8E1530)
View on Reddit #46732763

Terminator857@reddit

Intel stop playing tiddlywinks : give us cards with 32 GB of memory, 48 GB of memory, 64 GB of memory.  Speed is much less important than capacity. We don't need pro speed. Consumer speed is fine.
View on Reddit #44251530

qrios@reddit

> Speed is much less important than capacity. We don't need pro speed. Consumer speed is fine. Just use a CPU and lots of system RAM. then.
View on Reddit #44260715

Terminator857@reddit

That wouldn't reach consumer GPU speed.
View on Reddit #44299349

qrios@reddit

You're not even going to manage consumer speed trying to address that much VRAM on a card with that tiny a bus.
View on Reddit #44668590

Terminator857@reddit

Widen the bus then.
View on Reddit #44672804

qrios@reddit

They do. it's how you get $4,000 cards.
View on Reddit #44695453

Terminator857@reddit

We are talking about Intel, not others.
View on Reddit #44702173

qrios@reddit

Intel does not have any secret technology that allows them to cheaply increase bus-width for any cheaper than any of the other companies do.
View on Reddit #44881608

sdmat@reddit

M4 Pro/Max: May I introduce myself?
View on Reddit #44284743

Mickey_Arthur@reddit

This!
View on Reddit #44254354

cloudkiss@reddit

To dear intel. Go big or go home. Make it to be 64gb i wont care about 5090. Even at 4080 speed, it will be faster than mac or sharing layers with cpu.
View on Reddit #44456263

tu9jn@reddit

It has to be cheaper than 2X b580, but that never seem to be the case with pro cards.
View on Reddit #44239293

candre23@reddit

It won't be. It's a "business" card and will have a business price tag.
View on Reddit #44239994

Downtown-Case-1755@reddit

What business is buying an *Arc* card over Nvidia?
View on Reddit #44248294

Enturbulated@reddit

For their pricing strategy, you're asking the wrong question. "What does Intel think/hope they can get sales numbers up to in xx sector in xx months" is more like it.
View on Reddit #44249551

Downtown-Case-1755@reddit

Right, but I ask again... who TF is buying an *Arc* GPU for professional use? They would at least get some sales in a consumer price bracket (including those from business), but I can't picture any business putting up with Intel compatibility issues over just buying Nvidia or even AMD. Maybe there's a niche for video encoding? That seems really small though, and a waste of most of the silicon.
View on Reddit #44262184

SteveRD1@reddit

I mean...not every business has META, GOOGL, AAPL, MSFT money. Some businesses will be like 'ok Joe, here's your budget for the AI model you sold us on in your project plan'. It may not be enough for cards that cost tens of thousands.
View on Reddit #44336331

Downtown-Case-1755@reddit

But then you're just looking at a 4090. Or even a 7900 XTX. The later is probably cheaper, both are faster and far more compatible.
View on Reddit #44336694

SteveRD1@reddit

7900 XTX is a fair competitor, though has many of the same 'not NVIDIA' issues. I don't think the average business could get a 4090. Availability is kaput even for someone less 'discerning' about the quality of the seller.
View on Reddit #44337041

Downtown-Case-1755@reddit

> though has many of the same 'not NVIDIA' issues. rocm is *way* easier to deal with in 'business' frameworks like vllm, or the venerable llama.cpp if you don't want to fuss with setting up pytorch flash attention.
View on Reddit #44338068

candre23@reddit

Exactly. "We could make slightly more money by selling XX of these cards to businesses at $YYYY than we would by selling XXXX cards at $YY to consumers. Therefore, we'll market them to businesses for the higher price and pull down more money for less manufacturing effort".
View on Reddit #44250285

OrangeESP32x99@reddit

Isn’t Nvidia back ordered already? If that’s true, then many people will buy these up until Nvidia catches up to demand.
View on Reddit #44266561

Downtown-Case-1755@reddit

Not at this tier, right? This is way below a W7800, an L4, or whatever, it's a relatively low tier GPU, where the equivalent Quadros aren't so backordered.
View on Reddit #44274783

Cantflyneedhelp@reddit

I would buy it in a heartbeat if it had GPU virtualisation support like their enterprise cards.
View on Reddit #44253070

MoffKalast@reddit

Nah they can totally probably charge a premium for people to be able to stack two of these for 96 GB of VRAM.
View on Reddit #44312453

ForsookComparison@reddit

There's consumer video games where 12gb doesn't cut it now (Indiana Jones) - a lot of normie gamers are being told to buy more VRAM
View on Reddit #44250642

noblex33@reddit

Pros: 24GB, probably much cheaper than NVidia and AMD Cons: poor software support 💀💀💀 So basically the same story as with AMD, right?
View on Reddit #44238099

masterlafontaine@reddit

OpenCL let's goooo
View on Reddit #44238400

djm07231@reddit

I think Intel is going with oneAPI/SYCL these days.
View on Reddit #44243960

Calcidiol@reddit

Yeah but OpenCL and Vulkan are also supported. From that standpoint the intel GPUs have among the most excellent openness and diversity of compute language / framework support options of any others. First class SYCL/oneAPI. Relatively good OpenCL. And Vulkan too.
View on Reddit #44261184

Picard12832@reddit

Sadly I've had a ton of issues with Vulkan compute workloads on my A770 on Linux. Very inconsistent performance, often bad. Very hard to optimize for.
View on Reddit #44270449

Calcidiol@reddit

You're ahead of me wrt. that. I've been meaning to revisit the a770 performance & dev. soon but am just setting up for that so I'll see. Do you have any major tips wrt. optimizing use of A770 / vulkan / sycl / whatever these days? I'm kind of torn with the reported new 24GB ARC in this thread, on the one hand I see from the A770s that they're able to make decent HW for a lower price point than nvidia in midrange consumer GPUs. But the less mature / full support for linux has been frustrating and the slow pace of better SW support in the LLM/AIML/GPGPU ecosystem for better use of any relevant technologies whether sycl, opencl, vulkan, pytorch limitations / just now getting direct xpu support etc. etc. have been frustrating. Yeah I think being relatively newer (vs sycl, opencl IIRC) that vulkan is having some growing pains and also being relatively less used for compute vs. graphics. But given the powerful nature of a free to use open standard that solves the problems of both portable graphics and portable compute I'm sure it'll be getting a lot better on platforms which are open enough & supported enough to be updated / maintained over the next couple years. FWIW I've seen some of what appear to be (from only a glance at the release announcement summary text) potentially relevantly nice changes in both sycl and vulkan related improvements and fixes lately in llama.cpp's release notes from the past few days / week(s). https://github.com/ggerganov/llama.cpp/releases b4382 rpc-server : add support for the SYCL backend (#10934) b4393 vulkan: multi-row k quants (#10846) (etc.) b4396 vulkan: Use push constant offset to handle misaligned descriptors b4397 vulkan: im2col and matmul optimizations...
View on Reddit #44280606

Picard12832@reddit

As far as I know SYCL and Intel's own IPEX code runs best on A770, but I only have the card because I'm developing the llama.cpp Vulkan backend. As far as I know it even beats SYCL in text generation performance in some cases, but prompt processing performance is not good. I haven't found a good way to optimize for A770, it doesn't behave the same (more predictable) way that Nvidia's and AMD's cards do. As an example: I had a lot of trouble getting the XMX matrix accelerators to work. They just slow the card down on regular mesa versions, only on the latest mesa they kinda start working. But for whatever other reason text generation performance dropped significantly with latest mesa. There's always something. I just don't have that as much time to divert to Intel as would be needed.
View on Reddit #44337930

fallingdowndizzyvr@reddit

OpenCL? Dude, even the people in charge of OpenCL are pushing SYCL to replace it.
View on Reddit #44266985

lordlestar@reddit

if this thing is cheap, then people will give it software support
View on Reddit #44242927

SteveRD1@reddit

Exactly.
View on Reddit #44336432

Puzzleheaded_Swim586@reddit

Intel got their gaudi in the mlperf benchmark, AMD didn't even try. Intel is a more serious player than the other company with a DEI CEO. Semi analysis report about amd was brutal. IT body shops in South Asian countries have better engineering leadership than AMD. https://preview.redd.it/p9kseddams9e1.jpeg?width=1080&format=pjpg&auto=webp&s=9b2357e556708ce9167f27c9f0047793894feafd
View on Reddit #44240120

LostHisDog@reddit

Just in case you didn't know, anytime you use the word DEI to degrade a person all we hear is "I am a racist twat" - in case you weren't sure where the down votes were coming from.
View on Reddit #44240883

Puzzleheaded_Swim586@reddit

I literally put a screenshot of the incompetence and if my labelling DEI triggers someone and in turn label me a racist, there is no difference between them and me. We are same🤷‍♂️
View on Reddit #44241362

Environmental-Metal9@reddit

The racism is in equating DEI with incompetence. This CEO could be 100% incompetent on their own merit. No need to bring background into this. Plenty of other CEOs are really bad at their jobs and still have a job, and wouldn’t be a “diversity hire” which is a crazy take to think that our corporate overlords care a single bit for that. 100% of the times it is who can extract more profits from poor saps like us.
View on Reddit #44242857

Caffdy@reddit

Imagine reducing Lisa Su to a simple monicker like "DEI CEO"
View on Reddit #44245770

Environmental-Metal9@reddit

I’d argue that this is a valid take for any one person, not just a CEO. Imagine having a persons entire life and experiences reduced to “DEI hire”. The fact that people fundamentally misunderstand (or willfully misrepresent) the concept of diversity of culture is sort of baffling to me. I am yet to hear one good faith non racist argument against the practice of embracing diversity at the workplace.
View on Reddit #44249171

Bacon44444@reddit

Fuck it. I'll throw one at you. In good faith, honest to God. I'm not trying to look down on anyone or be racist. In fact, I'm not even taking a position. I'm going to simply present to you a good faith, non-racist argument. When hiring, a lot of things need to be factored in to make a great hire. You should really be looking for the quality of the candidate. The best. And if the color of their skin changes that outcome for you, you are the racist. For a fact. If their gender, you're the sexist. I could go on. That's a double-edged sword there. If you choose *not* to move forward with the best candidate based on race, gender, sexuality, etc, that also makes you all those things. It cuts both ways when you judge someone not based on the contents of their character but the color of their skin. The best thing for any organization is to move forward with whoever is most qualified. Who has the best quality. Obviously. And if you want to argue against that, you're not pounding away in anger at me or my words, but the cold, hard facts. If you don't hire the best, your competition just may, and their job is to put you out of business. I suspect that dei is really just an overcorrection. An understandable one. For centuries, people were also chosen for their race and gender. They just went the other way with it. Well, now we've all shifted to the complete polar opposite in response. Fine. I get that. It's just not a good long-term solution. It hurts companies and people, too. You want to be represented? Let your ideas and your contributions do the talking.
View on Reddit #44256032

LostHisDog@reddit

All that is exactly what I would say if I was a super duper racist person who had just zero qualms about continuing to step on the minorities that my people have oppressed through the ages. It takes just a huge amount of willful ignorance of or as is often the case just outright hatred for "others" to not look at the past to see why we are trying to build a different future. There are countless deeds still on record for houses all around the country that stipulate that those houses are never be sold to anyone but whites. All those "historically black colleges" you've heard about were not thus named because that's where the black people decided to attend school. There are millions of African Americans alive today who would have had to literally sit at the back of the bus or go to separate schools or eat at different restaurants and for whom lynching by local authorities was, and lets be honest, still is, a very real possibility. DEI isn't about replacing white people with black people, it's an acknowledgement that we have FUCKING SHAT ON AFRICAN AMERICANS and other minorities for centuries through institutionalized and legislated discrimination policies and practices that have left them disadvantaged against their oppressors. DEI is an opportunity for anyone who is hiring to consider the possibility that the person of color they are interviewing might not meet all the traditional check boxes a hiring manager is looking for but that given the opportunity they have thus far been denied, would flourish in the career they are seeking. Maybe when a few generations of the oppressed can build up familial wealth, attend the same schools as their white peers, drive down the street and not risk being shot for looking suspicious, cash a check without having the police called... you know... exist on equal terms with white people... maybe then DEI might sound quaint... but more likely, we will have just shifted our oppression to someone else. So no... it's not fair to just consider the candidate and let your contributions do the talking when your potential contributions were stolen by an entire race of people who mostly are in charge now and looking down on you for how little you have to bring to the table. IMO of course and from the perspective of a white guy who probably doesn't get it either.
View on Reddit #44289454

Environmental-Metal9@reddit

I actually don't disagree with most of that take. Will you allow me to focus on one specific part of it, not as a way to dismiss the rest, but to further expand the thinking behind why people still feel like DEI initiatives are important? The last part of your take, about wanting to be represented, then one should let one's contributions speak louder? I can for a fact talk about that very personally. I am a an american citizen by birth and by familial right, but grew up in Brazil my entire life. I didn't speak english when I first moved to america, and for the first 2 years of my working career here, I would always be treated differently from my peers due to my heavy accent, in spite of the things that I was saying and contributing to being of similar value as my peers. I'm not speaking hypothetically, I am talking about entire two years of my initial career here having my ideas being passed on because I couldn't represent myself properly. Not for a lack of skills to do so, mind you, but because people dismissed my ideas on the account of my accent. The reason I say it is my accent is that after my accent started reducing, people started listening more. This is just one example of how institutionalized racism happens without people even noticing. I don't think those people realized they were doing that. I very much would have appreciated having a voice and proper representation. If I broke through that ceiling, it very much wasn't just because of my own merit, but a combination of effort, people taking up for me, and societal norms shifting. The idea of changing an organization's way of functioning to be more equitable to everyone is something most non-business owners should be rallying behind, not because it makes the organization better for profits, but because it makes it better for people. We shouldn't want what our corporate overlords want for their businesses (they would totally get rid of you or me if they could), and DEI initiatives, are only one small fractional way of making jobs better for the whole of society. It isn't a system of fairness in the meritocratic way, because so far merit has had nothing to do with real effort being put, but rather how much more money have you made for your boss, and that is a pretty poor metric for merit. In a society that has been historically unfair to a large group of people, the only way to make it equitable is to not be fair in the classical way people think of fairness (one for me, one for you), but rather think about a larger picture in which being fair means that things will be uneven for a while until they can even out. Anyways, that is my take on it. I do appreciate that you took the time to give me an example, and I hope my response doesn't come across as an attack, but rather an expansion of the idea.
View on Reddit #44257380

staatsclaas@reddit

But you’re the one who reduced a hyper-qualified MIT PhD computer scientist to “DEI hire.” Seriously, WTF mate. Then you added a bunch of philosophical posturing for…reasons? The original statement is the problem, and as long as it’s sitting there in the thread no amount of editorializing will make you look any more justified.
View on Reddit #44261662

Environmental-Metal9@reddit

What? Where? I am vehemently criticizing reducing anyone to “DEI hire”
View on Reddit #44263693

staatsclaas@reddit

Haha, sorry I responded to wrong person!
View on Reddit #44264251

Environmental-Metal9@reddit

Phew!!! For a second there I was worried I had grossly misrepresented my point! No biggie!
View on Reddit #44264359

staatsclaas@reddit

Yeah the “something_somethingnumber” usernames run together after awhile.
View on Reddit #44264434

LostHisDog@reddit

I just didn't know if you understood why you were getting the down votes. No one on our side is "triggered", just embarrassed for you is all. Since we all share an interest in AI, I was trying to politely point out why this particular conversation of yours failed here.
View on Reddit #44244079

stylist-trend@reddit

Nobody's triggered. You're just outing yourself as a racist and can't handle it when people call you out for it.
View on Reddit #44243060

staatsclaas@reddit

That’s some heavy copium.
View on Reddit #44242789

MassiveMissclicks@reddit

"During her tenure as CEO of *AMD*, the market capitalization of *AMD* has grown from roughly $3 billion to more than $200 billion." \-Quick Wiki search. Truly incompetent... How dare she?
View on Reddit #44246225

burnqubic@reddit

my theory is that most buyers will be themselves software devs which will result in more oss for the platform.
View on Reddit #44240118

BuildAQuad@reddit

I think this is a crucial part of the puzzle. You need a critical mass of devs with the cards to get support up a running.
View on Reddit #44261556

shing3232@reddit

Nah, it's worse
View on Reddit #44240024

noblex33@reddit

why?
View on Reddit #44240096

shing3232@reddit

software side is even worse because it has no cuda backwards support like hipblas or zulda
View on Reddit #44242382

mapppo@reddit

Looking like this round will fund their driver updates and sometime in 1-4 years i will swap
View on Reddit #44257250

djm07231@reddit

I have heard that their drivers are better and Intel software support has traditionally been much better than AMD. When tiny corp tried developing custom AI frameworks to work with Intel and AMD cards, it was comparatively easy on Intel while they had constant crashes on AMD.
View on Reddit #44244089

Xanjis@reddit

Fine with that. I've got a greymarket Chinese 3060 franken GPU in my system.
View on Reddit #44248095

FullstackSensei@reddit

It's not a competition, but I don't think anybody can beat AMD in poor software support
View on Reddit #44239409

matteogeniaccio@reddit

This is true only in the GPU market. If you talk about generic accelerators, then the worst offender is Xilinx...  Wait! Xilinx is owned by AMD? Now I understand.
View on Reddit #44240364

newdoria88@reddit

It must be really hard to fit vram into a card, huh?
View on Reddit #44267017

Colecoman1982@reddit

As far as I understand it, there is always a hard limit on the max amount of VRAM a given chip architecture can handle that is baked into the design.
View on Reddit #44283543

tmvr@reddit

The limits are determined by the size of the memory chips available (currently 2GB for both GDDR6/X and the upcoming GDDR7 while GDDR7 should get 3GB chips later in 2025) and the memory bus width. The memory chips are 32bit and the GPUs have bus width as multiples of 32, so 64, 96, 128, 160, 192, 256, 320, 384 etc. So with 128bit nus you can have 4GB (4x1GB) or 8GB (4x2GB) VRAM etc. In addition you can run the memory controller and chips in clamshell mode where two chips are connected to the same controller each using 16bits for the total of 32bits and doubling the available capacity. This is how the first batch of 3090Ti was for example because there were no 2GB chips available only 1GB so they had to run 12 on one side and 12 on the other side of the PCB for 24GB total connected to the 384bit bus. The 48GB professional cards from NV or AMD are done the same way, they have the same 384bit bus as the consumer cards and use the same 2GB chips, but they have 2x of them in clamshell mode so you can have az A6000 or A6000 Ada with 48GB VRAM.
View on Reddit #44326605

fullouterjoin@reddit

This is awesome, but damn, they should pull over the 24GB line for a couple reasons. 1. It would literally differentiate their offering, when searching for a GPU, it would give another memory size that is different from NVDA offerings in the consumer space. It could be 26GB or anything, but it should be *more* than 24. 2. If they wanted to charge premium, they could go with anything larger than 32GB, 36, 40, 48, 52. GPU memory is cheap, GPU memory attached to a GPU is expensive. By going with 24, it feels like they are going for the "we have 24 at home" for 1/2 the price of NVDA. Like they want people to do a 1=1 price comparison and/or stumble across it when searching for 24GB GPUs. That number is somewhat arbitrary, there is nothing magic in it. God Intel and AMD are dumb. MBAs have rotten the mind of both organizations.
View on Reddit #44259163

Smile_Clown@reddit

>That number is somewhat arbitrary, there is nothing magic in it. I get frustrated and yet also amused by redditors standing on a soapbox without the slightest clue of what they speak. I just want to point out to you that 26GB isn't really a feasible thing, nor is 30, 31, 33, 38 or whatever other number you come up with. These numbers (capacities) are not arbitrary. Common Memory Capacities (Aligned with Standards): 4, 6, 8, 12, 16, 20, 24GB etc These capacities fit standard memory bus widths (e.g., 192-bit, 256-bit, 384-bit). 4GB: Often seen with a 128-bit bus using 4x 1GB chips, while 24GB: Matches a 384-bit bus using 12x 2GB chips. You can do your own math to see how the in between work out (you won't, but you could). Capacities like 26GB don’t align well with standard memory buses or chip sizes. They would require uneven or non-standard ***chip configurations***, leading to inefficiencies or higher costs. The proper industry alignment ensures memory is efficiently utilized without leaving gaps or underutilized chips. There are other considerations, but this is already too much effort for someone who will ignore reality. >God Intel and AMD are dumb. I know you will not see it, because you still think you are right, even if you read this, but that's really funny. I love it when someone calls something or someone dumb but has no idea what they themselves are talking about. It's delicious irony.
View on Reddit #44270168

fullouterjoin@reddit

Yes they have to deal with what Samsung and Micron sell. I have read the datasheets that don't require an NDA. The same admonishment you think you are delivering to me also applies to you. Instead of arguing about the small stuff, read into the point someone is making and argue against that. AMD and Intel are fighting a war that is 10 years out of date. AMD with bad software and Intel with a SKU explosion and still thinking it can segment its market and match parity on "features". Dumb is a volume, not a point on a single dimension. Some of the dumbest people are I know are geniuses. You are probably a genius.
View on Reddit #44273111

Smile_Clown@reddit

>Instead of arguing about the small stuff, read into the point someone is making and argue against that. Nice try. Accept the L and move on, you know little of what you speak. The literal point you made was to stand out and make a 26gb... You do not get to make a broader point and act righteous ***after*** you have specifically singled something out as a point of contention. Life pro tip: do not do this in real life, it's leaves an impression and not a good one. Example, I maybe an asshole on reddit but IRL, I do not open my mouth unless I know what I am talking about. >Some of the dumbest people are I know are geniuses That I agree with, but it's really universal. Some feel the same about you.
View on Reddit #44317456

Crafty-Struggle7810@reddit

That's fantastic news.
View on Reddit #44313964

Elite_Crew@reddit

OneAPI/SYCL support lets gooooo!
View on Reddit #44244677

MoffKalast@reddit

Yeah if IPEX could be less horrible to work with, that would be great.
View on Reddit #44312534

scottix@reddit

If Intel really wanted to show they are pro, they would drop a 48GB card. To call something PRO that is only 24GB is laughable now.
View on Reddit #44256655

big_cock_lach@reddit

Can you tell that to Nvidia?
View on Reddit #44302352

segmond@reddit

They are not serious. If they want to take on Nvidia, 48gb.
View on Reddit #44253498

Cyber-exe@reddit

Might be coming in the form of a double slot pro card. The fact this current one exist means we probably won't get any 32gb variant of the B770 which would've been a killer deal for anyone wanting a do everything desktop GPU. If they at least make a 48gb pro card that's a good deal they should be able to snatch a slice from nvidias moat either way.
View on Reddit #44275028

segmond@reddit

48gb card. Doesn't need to be pro. It will fuck Nvidia, they will make more profit than they can imagine. Nvidia is too cocky to react to that. Nvidia will stick with their high price until it's too late. They only option they would have is to slash everything by half which they won't do.
View on Reddit #44300471

MatlowAI@reddit

Intel is onto something great here. $250 for the b580 which is pretty much 1/2 the performance of the 3090 in ram capacity, gaming FPS, memory bandwidth... if you get 2 for $500 that's a tempting proposition as is to not be using an old card that may have seen mining duty. If we are talking 2x 24gb cards for 3090 bandwidth and compute or 4x cards for 5090 bandwidth and compute but now all of a sudden we have 48 or 96gb of ram for less than the 3090 or 5090 respectively.... I'm sold on as many as I can afford after selling one of my 4090s... do I wish they had a board with 2x the compute for diffusion? Sure. But its worth it for llm batching and larger models securely... Imagine if they brought back SLI style for gaming with current methodology too...
View on Reddit #44289960

Ok_Warning2146@reddit

Suppose it is simply B580 with doubled bandwidth, this will put its VRAM speed at 912GB/s which is slightly slower than 3090. But B580 only has one tenth the TFLOPS of 3090. So I don't have a high hope that it can be replacement of 3090.
View on Reddit #44289611

ttkciar@reddit

What's the advantage of this product over an old $500 AMD MI60 with 32GB of VRAM?
View on Reddit #44238427

Tmmrn@reddit

It'll be possible to actually buy one in europe presumably.
View on Reddit #44248007

ttkciar@reddit

That's an excellent point. Thanks. And fooey on me for not thinking outside of the USA.
View on Reddit #44286612

FullstackSensei@reddit

Single slot, active cooling out of the box, to name two
View on Reddit #44239490

Someone13574@reddit

We don't know that it is single slot though. The source of the leak doesn't mention it and the photo is a photoshopped A60.
View on Reddit #44250478

rawednylme@reddit

I've been quite happy with my A770, alongside my P40. If Intel get a reasonably priced 24GB card out to market, I'd buy in a heartbeat.
View on Reddit #44286350

Successful_Shake8348@reddit

Pro cards would not get consumer prices... They are like 10x more expensive... So I doubt we will get it next to nothing like the B580 12GB. But I'm ready to be surprised by Intel. ;)
View on Reddit #44246672

Puzzleheaded_Wall798@reddit

this is silly, people would buy 2 B580 for $500 if they tried to charge too much for the 24gb model. Also 10x? someone is going to pay in the vicinity of 5k for this card? what shitty businesses are making these decisions?
View on Reddit #44265933

Lissanro@reddit

Better yet, add another $100 or so and just get used 3090. Getting a pair of 3060 12GB is another alternative... I mean, to compete with Nvidia, the competitor need to offer lower prices, given the customers have to deal with worse software support, more bugs and other issues. If Intel offers 24GB card for $500-$600, let alone higher than that, I would never consider buying one. I am not fan of Nvidia at all, and would be happy to support any competitor if they release worthy product at reasonable price, at least 1.5-2 smaller in price than Nvidia products (this can be at the cost of lesser compute and worse software support, but with bigger VRAM).
View on Reddit #44285723

orrzxz@reddit

Please let it be 500\~CAD. I swear to god, I will put up with whatever CUDA bullshit I have to so I won't have to pay over a fucking grand for a 3090, a 2 gen old card.
View on Reddit #44278632

gfy_expert@reddit

But can consumers actually buy this one? Says not for gamers,for datacenters
View on Reddit #44242270

Cyber-exe@reddit

You can buy Nvidias top tier H100 for 40k on Amazon
View on Reddit #44275083

Healthy-Nebula-3603@reddit

WHY ONLY 24 GB !!!
View on Reddit #44273028

sammcj@reddit

Bring on a single slot 48GB, then we'll be talking!
View on Reddit #44271306

DigThatData@reddit

> Intel `no_one_cares.png`
View on Reddit #44268763

lurkn2001@reddit

Would ollama be able to use it together with 24Gb 3090?
View on Reddit #44267316

SevenShivas@reddit

Very high VRAM for low price is the only way to get me buying intel gpu. They need to wake up and fill the gap for enthusiasts
View on Reddit #44263709

Equivalent-Bet-8771@reddit

MORE!
View on Reddit #44261705

Calcidiol@reddit

Another possible "contra" of this is inspired by looking at what people said about the ARC Pro A60 GPU which I guess is the highest one in that product series for ARC A GPU ICs. People (I just searched) were saying it was vaporware, impossible to find as part of pre-assembled PCs, nearly impossible to find for sale as a DIY upgrade. And people said that kind of stuff for approaching 2 years between late 2022 and early/mid 2024 even though the A7/A5 consumer GPUs were available all that time. So we'll see what the MSRP is for this but also when and where it is even possible to buy it. I bought A770-16s when they launched and would be looking at this as an upgrade / expansion option but it'd have to be actually easily available retail as an upgrade at the right price point to make sense. Otherwise AMD / NVIDIA 24GB cards exist which are also under consideration, as well as cards with more than 32 GBy integrated (which is what I hoped we could see with a battlemage based card). Even just an A770 / B580 with 32GB, 48GB of relatively slower VRAM would be very useful. Even more useful would be just fixing the client desktop PC's upper range to offer 400GB/s RAM BW and let the CPU/IGPU/NPU do the work.
View on Reddit #44261031

Sylv__@reddit

Does OpenAI Triton works with Intel GPUs? Are they using OpenCL / vulkan or have a CUDA/HIP equivalent?
View on Reddit #44254783

stddealer@reddit

If this card can game half decently too, then I will end my friendship with AMD, and Intel will be my new best friend.
View on Reddit #44251933

Dorkits@reddit

Me to Intel now : You have my curiosity, now you have my attention.
View on Reddit #44251786

PositiveEnergyMatter@reddit

Take my money
View on Reddit #44250851

penguished@reddit

Memory... ooh la la...
View on Reddit #44249744

Tmmrn@reddit

If the rumors are true, that's more than AMD's RX 9070 XT 16gb....
View on Reddit #44248527

s101c@reddit

Really really hoping it happens and the price tag is below $500. Will be a very serious alternative to getting a used 3090.
View on Reddit #44248363

sapperwho@reddit

need a CUDA clone.
View on Reddit #44238945

emprahsFury@reddit

If you want a cuda clone you can literally buy AMD right now. What you get with a cuda clone is something that is always several years out of date and performs worse than the original.
View on Reddit #44239962

celsowm@reddit

Opencl?
View on Reddit #44239952