AMD reportedly working on gaming Radeon RX 9000 GPU with 32GB memory

Posted by HLumin@reddit | hardware | View on Reddit | 261 comments

Reply to Post

Reply

261 Comments

[-]

NGGKroze@reddit

If this is true, say bye bye to 599 9070XT 16GB. 599$ for 9070 799$ for 9070XT 999$ for 9070XT 32GB - you are getting 5090 VRAM, but for half the money (MSRP wise) while also being in performance between 4070Ti Super and 4080S. Problem with with such high VRAM, those will be scalped so hard so either you won't find any or you will find them, but lot higher than MSRP.

Reply

[-]

reddit_equals_censor@reddit

>Problem with with such high VRAM, those will be scalped so hard so either you won't find any or you will find them, but lot higher than MSRP. mid size dies with dirt cheap gddr6 and basic packaging. and a not new node. amd can pump those out way beyond scalping possibility. i mean hell they added 2 more months of production already with the delay of the cards. there is also 0 uniqueness about the 32 GB version. same die clam shell and done, so amd could easily shift most of their production to the 32 GB version if they are vastly more in demand.

Reply

[-]

Jeep-Eep@reddit

Yeah, RDNA 4 was designed with manufacturability and cost effect first in mind, in that order. They've acting like the console guys, not trying to catch nVidia this gen.

Reply

[-]

reddit_equals_censor@reddit

>not trying to catch nVidia this gen. it is worth remembering, that amd has a long history of just making midrange cards every other generation even often. so it is nothing new. now arguably with how shit the 50 series is, the cancelled chiplet gddr7 monster high end gpu could be seen as a mistake, but who could have guessed, that the 50 series will be such a piece of shit/stand still. and in regards to catching nvidia, if the leaks, which we got plenty by now are true, then amd is on part or ahead in architecture. amd performance/cost of the whole card should be ahead of nvidia, as they are still using dirt cheap gddr6, instead of gddr7 even. raytracing performance being massively up and fsr4 upscaling being a big step forward. so architecturaly they seem to be at bare minimum on par with nvidia, which is very impressive. and that is the hard part. making a high end version of an architecture just in general is not the hard part. although in rdna4's specific case it would been uniquely harder, because instead of just a massively bigger monolithic die, they went insanely chiplet based with split cores (rdna3 chiplet is child's play in comparison)

Reply

[-]

Jeep-Eep@reddit

Goddamn, when I called nVidia's AI obsession and its repercussions their 'intel node moment'... damn.

Reply

[-]

Mnmemx@reddit

there are a lot of companies you can reasonably accuse of "ai obsession", but probably not the one that is "selling the pickaxes" so to speak who currently has a license to convert all available tsmc allocation directly into money because the industry has an insatiable demand for nvidia GPUs regardless of how gamers view the latest RTX cards

Reply

[-]

Jeep-Eep@reddit

It is when it fucks up their ability to make other sorts of pickaxe.

Reply

[-]

pyr0kid@reddit

they ARE the console guys, they design the parts for xbox/playstation

Reply

[-]

Jeep-Eep@reddit

thatisthejoke.bat Yeah, and RDNA 4 seems to be them starting to more fully apply those learned lessons.

Reply

[-]

Morningst4r@reddit

People said the same thing about RDNA2, yet there was hardly any supply when NVIDIA GPUs were going for 3x MSRP. I hope they do pull it off but I’m not holding my breath.

Reply

[-]

noiserr@reddit

Why would the existence of a double vRAM model affect other models? I'm not following.

Reply

[-]

PalpitationKooky104@reddit

It dont 499, 599, 749. Would work for 9070's

Reply

[-]

NeroClaudius199907@reddit

Amd wouldnt dare. Youtubers & gamers will be very upset

Reply

[-]

moschles@reddit

The world desperately needs someone to migrate Pytorch to run on AMD (sans CUDA)

Reply

[-]

noiserr@reddit

This has been done for awhile now.

Reply

[-]

Strazdas1@reddit

The issue is you need to use ROCm. Most times, you cant use ROCm because theres some fundamental issues AMD refuse to address.

Reply

[-]

noiserr@reddit

Which version of ROCm have you used the last time? I've been doing a lot of pytorch development and it stuff just works.

Reply

[-]

Strazdas1@reddit

I dont remeber the version number. It was a bit over a year ago. We ran into an issue we couldnt solve. There was no documentation about it. Contacting AMD lead to nothing (basically got told to fix it ourselves). Eventually department head decided that it would be cheaper to just switch to Nvidia instead of trying to fix ROCm.

Reply

[-]

moschles@reddit

I'm hearing rumors that ROCm is "experimental" and nobody is seriously maintaining it.

Reply

[-]

noiserr@reddit

ROCm has made a lot of progress in the past year. AMD has literally sold over $5B worth of datacenter GPUs in 2024. This is a serious effort.

Reply

[-]

szczszqweqwe@reddit

Even if it's a 32GB 9070xt it would be a cool thing, especially if it's priced well. They can OC it and call it 9070xtx 32GB.

Reply

[-]

Aquaticle000@reddit

But why a *9070*? I feel like something with that much VRAM would be reserved for like a 9090 or 9090xt, like the 7900xtx with 24GB and the 7900xt with 20GB. This just seems really out of place?

Reply

[-]

MiloIsTheBest@reddit

The 9070 XT that's supposed to be positioned somewhere between the 20gb 7900XT and 24GB XTX? Frankly I'm annoyed it doesn't *already* have 24GB

Reply

[-]

reddit_equals_censor@reddit

>Frankly I'm annoyed it doesn't *already* have 24GB i think price should define if 16 GB is fine or not for now. a 500 euro 9070 xt with 16 GB seems fine this generation. a 600 euro one.. meh. having a double vram edition for the memory difference cost only would make up for that though. the rx 480 4 GB and 8 GB did that and people loved that. choice instead of scams.

Reply

[-]

Merdiso@reddit

Some of you are really delusional, 500 EUR would put it at 449$ MSRP tops, there's no way it will be that low, you're literally setting yourself for disappointments or just looking for excuses to buy nVIDIA.

Reply

[-]

reddit_equals_censor@reddit

500 euros may be 500 us dollars, based on theft and inflation theft at the time. and worth keeping in mind, that 450 us dollars for a 9070 xt would still mean lots of profit for amd... just to keep that in mind. >or just looking for excuses to buy nVIDIA. that is kind of hard for me, as i won't put a fire hazard 12 pin device in my home, nor will i torture myself with nvidia's gnu + linux proprietary drivers, nor will i throw money at missing vram. so wrong person to put that suggestion up :D and disapointment is all but assured by amd's leadership and especially marketing team. <points at last few releases with bad reviews, bad prices followed by price decreases shortly after, so that amd can anjoy bad reviews with acceptable prices at the same time :D

Reply

[-]

Strazdas1@reddit

>based on theft and inflation theft at the time. are you sure theft is the right word? it makes no sense.

Reply

[-]

reddit_equals_censor@reddit

inflation is fully artificially created by physical or digital money printing, that may or may not have any backing. as the nuked value is now in the printed money and as that printed money is used by the kakistocracy and doesn't go the public, that is indeed theft! here is a great lil meme video with lovely music, that explains it in a very nice way ;) [https://www.youtube.com/watch?v=WEMCYBPUR00](https://www.youtube.com/watch?v=WEMCYBPUR00) the graph in the background is the value of the us dollar ;) and important to understand, that inflation is NOT inherent to any money system in society. any such idea is complete nonsense. we had tons of money systems without inflation in the past. to name just one commonly known one, directly value backed currency or direct value in the currency. as in gold backed currency or silver backed currency, or direct silver or gold coins with fixed weight. so yeah the added theft in the form of "tariffs" is far from the only theft from the government. but any more than that would e too far off from the hardware topic. but it makes sense to know when you look at hardware prices over 2 decades with money printers going brrrrrrrrrrrrrrr very hard to devalue YOUR money.

Reply

[-]

Strazdas1@reddit

while money printing encourages inflation, thats not its only source. i understand your dislike for inflation, and we could discuss this somewher else as this isnt the right place if you want, but i dont agree it should be considered theft. And tariffs are only theft if you are one of those "all taxes are theft" people.

Reply

[-]

Strazdas1@reddit

449 msrp tops is what it would need to be if they want to stop loosing market share.

Reply

[-]

Merdiso@reddit

So basically half the price of 5070 Ti in the "real world" ? AMD is not charity, you know.

Reply

[-]

Strazdas1@reddit

Half the features would require half the price. and AMD must bring a very good value proposition if it wants to gain market share.

Reply

[-]

noiserr@reddit

16GB is fine for gamers. Offering a 32GB version for a little extra is perfect imo. People who need 32GB will easily pay an extra $100 or so for it.

Reply

[-]

reddit_equals_censor@reddit

almost as if one shouldn't throw their naming over board to match the competition, that is already full of shit with their naming :D an 8800 XTX 32 G would for example slot in way better naming wise if they wanted to, but nah.... let's not do that :D gotta make sure, that no one has any idea what an amd card's name means even enthusiasts, while the competition, that owns the market has scamed people with names, but kept them consistent for years and years.... uh shit! what if amd already changes names completely before the 32 GB version of the 9070 xt comes out ;)

Reply

[-]

RedPanda888@reddit

Remember Nvidia released the 4060ti with 16GB vram, which now is still equal to the 5080 vram. Helps throw a bone to people who don’t need top top tier gaming but need the vram at a lower cost.

Reply

[-]

Jaznavav@reddit

Because it's their top die this generation, you can't go higher.

Reply

[-]

szczszqweqwe@reddit

Well, as far as we know a bigger die doesn't exist, it will take a lot of time to resurrect it, as they abandoned it long time ago. If they want to release it soon-ish 9070xt and memory on a back of a PCB is probably the only way.

Reply

[-]

SubstantialInside428@reddit

70/80/90 class are separated by their DIE class/design, not the RAM

Reply

[-]

Jeep-Eep@reddit

We did see some power specs that suggest some aggro factory overclocks.

Reply

[-]

FormalIllustrator5@reddit

Spoiler alert - will be using 12V-Nshitia cable..

Reply

[-]

FormalIllustrator5@reddit

To all downvoters... https://videocardz.com/newz/sapphire-radeon-rx-9070-xt-nitro-pictures-reveal-hidden-16-pin-power-connector-design Lol...or you are just Nshitia fans?

Reply

[-]

Reggitor360@reddit

Only Taichi, since AsRock doing AssRock moves agin

Reply

[-]

reddit_equals_censor@reddit

if amd aren't idiots, then they will release a 9070 xt with 32 GB, that has only the memory cost difference added to the 9070 xt price. why? because the profit stays the same. it gives them a great vrm argument, especially if they ONLY release 16 GB cards and up this generation and it makes it look better to the 5080 insult and even the 5090, if you want a card to use for a long while or need more vram for other applications already. that would be the smart move, that would be the "gain market share" move. so amd marketing might fight tooth and nail from letting that happen :D

Reply

[-]

Ascender4ever@reddit

This idea, is exactly spot on. Hope greed doesn't get in the way of patience and sheer logic. If I'm even considering buying the 32gb version after a solid no to the 16gb version due to pcie 4.0 rather than pcie 5.0 That! Is exactly what would do it for me and possibly countless others.

Reply

[-]

Plank_With_A_Nail_In@reddit

So anyone who was in the market for a 9070 16GB is now going to skip it and wait for the 32GB version, well done AMD.

Reply

[-]

Ascender4ever@reddit

I was going to get the 16gb version until rumors came out it was only pcie 4.0, that is what changed my mind. 32gb made me possibly think twice about waiting for pcie 5.0

Reply

[-]

Ascender4ever@reddit

I believe, they would want as many people to be happy as possible. No, it will not cut the amount of devs, yes, it would indeed bring more people to AMD. If the new 9000 series is indeed pcie 4.0 and 32gb vram I am now tempted to get one even though I was waiting for pcie 5.0 simply because of how much Vram is stated as being placed within the card. No, developers will always choose what works best for them either NVIDIA or AMD or Intel no matter what gamers get it would only help the developers bring in more cash as it would open the door for one reason more a game should cost more. They aren't going to care if consumers have a card as powerful as theirs that just means a finer product. Inflation will indeed happen eventually, hopefully so will paychecks, but without some reason to build a game we will never have them as without this new ram capacity there would be no need.

Reply

[-]

Ascender4ever@reddit

P.s. while the developer market is needed.. having a program for seasoned developers to receive a discount or free cards seeing that development is like what? No more than 1-10% of the market share (I could be wrong I'm guessing) but that way through a special program they would get a free card so long as they optimized their games for AMD would be a wise move. Also, if you like the company you buy video cards from solely for the sake of buying from that company, just beware of burning cables (speaking about 12VHPWR). So have fun and beware of all the ups and downs of every company.

Reply

[-]

From-UoM@reddit

Its almost certainly the Radeon Pro Line which has 2x memory with double sided vram Here is the W7900 (70990xtx) with 48 GB [https://www.amd.com/en/products/graphics/workstations/radeon-pro/w7900.html](https://www.amd.com/en/products/graphics/workstations/radeon-pro/w7900.html)

Reply

[-]

mckirkus@reddit

Hard disagree. You can buy a 16GB 7600xt for $300. The cost of doubling the VRAM to 32GB can't be that much more than the cost of the entire card. Maybe I'm not factoring in bus width, or different, more dense chips, but my kids old 3060 has 12GB. For AI, and rendering tasks, just having enough RAM is the most important barrier, how fast it works after that is important but not like that first hurdle. For what I do, CFD can use two GPUs (FluidX3D with OpenCL), and Ollama for AI, BOTH can use more than one GPU and it's additive. So getting two GPUs that total 64GB for around $2k would change everything.

Reply

[-]

From-UoM@reddit

This is not a gaming card. Or else they could have have launched this too with Gaming rdna3 32 GB but didn't The 32 GB based on rdna3 exists btw https://www.amd.com/en/products/graphics/workstations/radeon-pro/w7800.html

Reply

[-]

mckirkus@reddit

Your argument is that because they haven't released it in the past, they will never release one? 24GB consumer cards exist, but 32GB is impossible?

Reply

[-]

From-UoM@reddit

24GB exists because of the 384 bit bus 384 bit bus allowed for the 48 GB pro card. Nvidia does the same. 24 GB of the 4090 and the L40/RTX 6000 Ada has 48 The 5090 will get a pro card 64 GB and possiblly 96 GB variant. The 5080 will get turned into the 32 GB pro card. Pro cards will get the full future set including fully unlocked fp16 perf (on atleast Nvidia cards) with additional software and enterprise support.

Reply

[-]

HilLiedTroopsDied@reddit

valid points but there is such a thing as 3gb chips instead of 2.

Reply

[-]

ResponsibleJudge3172@reddit

And why would you assume they will do what they have never done, using knowledge that they have always had pro cards with double sided VRAM? Why would they cannibalise their pro lineup?

Reply

[-]

PalpitationKooky104@reddit

To support a 5b and growing ai business, and at the same time take gaming market share

Reply

[-]

mckirkus@reddit

Because it would also cannibalize Nvidia's pro lineup.

Reply

[-]

Vb_33@reddit

It is a gaming card the post has been edited as the leaker confirmed it.

Reply

[-]

Tuna-Fish2@reddit

32GB Navi48 would not have a different bus width or denser chips, but just clamshell ram. The spot price for 16Gbit GDDR6 modules is ~$8, add some premium for the top speed grade and a bit more for a more expensive PCB, the cost add to AMD would certainly be below $80.

Reply

[-]

PalpitationKooky104@reddit

so 499 for 9070, 599 for 9070xt, 749 for 9070xtx 32gb. Who would buy 5070ti or 5080?

Reply

[-]

ResponsibleJudge3172@reddit

Same with 48GB Navi31 workstation and 48GB rtx 4090 based workstation cards

Reply

[-]

Tuna-Fish2@reddit

Yep. I made no claim on the price difference between the card with doubled ram config and base, only clarified the cost difference. AMD always gets much higher margins from Pro-grade stuff.

Reply

[-]

JakeTappersCat@reddit

It says "gaming" explicitly. So, are you saying they're lying?

Reply

[-]

Strazdas1@reddit

its AMD marketing department, what do you think?

Reply

[-]

the_dude_that_faps@reddit

Sadly, this is what I think it is. I doubt they would cannibalize their professional SKUs with a consumer 32 GB GPU.

Reply

[-]

liaminwales@reddit

May be a second Radeon VII, failed pro cards? [https://www.techpowerup.com/gpu-specs/radeon-vii.c3358](https://www.techpowerup.com/gpu-specs/radeon-vii.c3358) Always wanted one, just there failure rates seemed a tad high.

Reply

[-]

the_dude_that_faps@reddit

I would've bought one if they made sense vs the competition. But then again, I had a 1080ti back then. It would've been a sidegrade and the crypto boom made them unubtanium.

Reply

[-]

HilLiedTroopsDied@reddit

how would amd cannibalize a market which hardly even exists for them? hah. Just give us huge RAM, we won't buy their pro line up anyways

Reply

[-]

the_dude_that_faps@reddit

I'm all for it, and would pay extra as long as the extra was reasonable. But it's AMD, if it makes sense for consumers, they probably won't do it. They're not the same company that brought us the first few Ryzen CPUs.

Reply

[-]

PorchettaM@reddit

Considering the state of ROCm, sacrificing some of their Pro margins in the short term to bring more devs into their ecosystem would probably be a smart move. Which means AMD is unlikely to do it.

Reply

[-]

kontis@reddit

It would be a bold and desperately needed move by AMD. It would be incredible if true. This would have the greatest chance of destroying CUDA moat. Tons of GitHub projects would try to support this card, by pushing for agnostic alternatives like TinyGrad. Heck, I'm in the "CUDA is required for my work" camp and I would buy this card in a heartbeat.

Reply

[-]

HotRoderX@reddit

I doubt it, the reason being... simple AMD has a track record of picking the ball up and dropping it with in 1-2 generations. There going to need to prove there reliable first. Otherwise why put in the work and effort. Nividia if nothing else is reliable you know there going to continue down the same path and innovate. Where AMD might have a brain fart decide to go branch off in a completely unrelated direction while completely dropping what they were doing and ignoring it.

Reply

[-]

Extra-Advisor7354@reddit

That’s a delusional take .

Reply

[-]

Banned_Dont_Care@reddit

>That’s a delusional take . Oh, would you let the boy dream!

Reply

[-]

topazsparrow@reddit

> would probably be a smart move. Which means AMD is unlikely to do it. LOL. 💀

Reply

[-]

Apprehensive-Buy3340@reddit

It is their stated objective to unify the software stack so every consumer can also be a dev, almost 20 years after nvidia did the same with CUDA.

Reply

[-]

ProperCollar-@reddit

That's what Vega and VII were. It's not AMD is trying this for the first time.

Reply

[-]

BiomassDenial@reddit

Basically, the 9000 series is a placeholder and the last of RDNA cards and whatever they do next will be on their new UDNA architecture.

Reply

[-]

Jonny_H@reddit

It was unified back when OpenCL and co. were the focus, that just fell apart and not maintained as the focus changed. Also about the same time AMD nearly went bankrupt, if we're looking at reasons *why* that might be :P

Reply

[-]

acc_agg@reddit

>Which means AMD is unlikely to do it. Beat me to it lol.

Reply

[-]

PalpitationKooky104@reddit

so fake story? Or you just bash amd for fun?

Reply

[-]

ExtendedDeadline@reddit

This hits hard :(.

Reply

[-]

lordofthedrones@reddit

I want to dream

Reply

[-]

Gansaru87@reddit

I just bought a 7900xtx, it better fuckin be a workstation gpu.

Reply

[-]

simplyh@reddit

> W7900 $5k?? What is this, an NVIDIA GPU?

Reply

[-]

Typical-Common3955@reddit

That.... Makes as much sense as the 4060 ti 16gb version... *Le sigh*

Reply

[-]

AssholeFramed@reddit

Jokes on you, the 4060 ti 16gb does actually make sense.

Reply

[-]

OafishWither66@reddit

9070XT with 32gb vram?

Reply

[-]

FuturePastNow@reddit

They *could* use 3GB chips to make a 24GB model. It wouldn't have enough memory bandwidth but that's never stopped GPU makers in the past.

Reply

[-]

pmth@reddit

How are you able to figure out that that the memory bandwidth wouldn’t be enough to make use of 24gb? I’m not sure how the calculations work.

Reply

[-]

FuturePastNow@reddit

It depends on the workload I suppose. AI stuff apparently only needs capacity rather than bandwidth so if that is the target market maybe it will be fine. But with GDDR6 the 9070s will have 640GB/s of memory bandwidth, compare that to the 5090's *1792*GB/s with GDDR7 and double the memory bus width. This is only part of the equation for a card's performance, but games won't benefit much from just doubling the RAM on a card like this.

Reply

[-]

Strazdas1@reddit

AI workloads are bandwidth starved almost universally.

Reply

[-]

Plank_With_A_Nail_In@reddit

If the model you want to run doesn't fit in VRAM then bandwidth don't mean shit as you are stuck using the CPU and performance will be awful. More VRAM is always better for AI workloads.

Reply

[-]

Plank_With_A_Nail_In@reddit

He's also assuming gaming is the only use of GPU's.

Reply

[-]

Strazdas1@reddit

They are using GDDR6 for this gen, so no 3 GB chips for AMD.

Reply

[-]

Kionera@reddit

This could be the 5090 killer for AI tasks, but knowing AMD they're probably gonna fuck something up.

Reply

[-]

Tiny-Independent273@reddit

if it is 9070 XT then it's just the extra VRAM so I wouldn't expect it to be as powerful for AI

Reply

[-]

Jaznavav@reddit

Speed doesn't really matter for hobbyist AI tasks, VRAM is much bigger barrier.

Reply

[-]

Strazdas1@reddit

Both are important.

Reply

[-]

Terrh@reddit

You can buy a dirt cheap 16gb card already from amd, MI series or a Vega FE. There are 32gb MI series cards as well.

Reply

[-]

Jaznavav@reddit

Sure. Can it game?

Reply

[-]

Terrh@reddit

Mine still does ok for being as old as it is.

Reply

[-]

bill_cipher1996@reddit

That DeepSeek model just needs a lot of VRAM, performance wise it runs great on even older AMD Hardware.

Reply

[-]

Mean-Professiontruth@reddit

It needs hundreds of vram..

Reply

[-]

F9-0021@reddit

The big models, sure. But there are smaller models that are less generalized or weaker that will fit in consumer amounts of VRAM.

Reply

[-]

ea_man@reddit

[https://www.reddit.com/r/selfhosted/comments/1ic8zil/yes\_you\_can\_run\_deepseekr1\_locally\_on\_your\_device/](https://www.reddit.com/r/selfhosted/comments/1ic8zil/yes_you_can_run_deepseekr1_locally_on_your_device/)

Reply

[-]

callanrocks@reddit

Run it on an old server with a TB of the stuff.

Reply

[-]

Apprehensive-Buy3340@reddit

Yes, you'd need multiple cards whatever you buy, so you'd want the cheapest vram/price ratio card that sums up to R1's total size within whatever PCIe slots you can get (mining cards got to 8 slots I think?)

Reply

[-]

gurugabrielpradipaka@reddit

Yep, with my 6900XT I can just use the model with 20B parameters. 32GB would be fantastic 😻

Reply

[-]

Kionera@reddit

It still competes with the 5090 in being able to run larger models.

Reply

[-]

sheldonrong@reddit

$799 I will buy it for sure for AI!

Reply

[-]

PalpitationKooky104@reddit

Could be 749.

Reply

[-]

NeroClaudius199907@reddit

Like 7900xtx and 4090s?

Reply

[-]

noiserr@reddit

7900xtx is great for LLMs. It's not as fast as the 4090 but it's not that slow either. For half the price it's a much better option actually.

Reply

[-]

Kionera@reddit

Those have 8GB less VRAM, and the 4090 costs way more. The 9070XT should also outperform the 7900XTX in AI tasks based on all the AI upgrades to Radeon driver features for RDNA4.

Reply

[-]

SomniumOv@reddit

> but knowing AMD they're probably gonna fuck something up. Watch it not support ROCm. Or get a mere 2 years of support.

Reply

[-]

iamthewhatt@reddit

It will no doubt "support" ROCm, but to what extent is entirely up to AMD's dev team. If they put even half the effort into the GPU software stack that nVidia does, they would be far more competitive.

Reply

[-]

Strazdas1@reddit

Given how muc heffort into ROCm they put so far, id say thats highly unlikely.

Reply

[-]

FinBenton@reddit

Only for some spesific things, every new open source tool and library you see coming out is build for cuda but if you are just infering a regular LLM then it should work.

Reply

[-]

noiserr@reddit

Been using my 7900xtx for LLMs for over a year. And it has been smooth sailing for me. I would definitely get two of these GPUs if they were priced for consumers as vRAM capacity is a major bottleneck.

Reply

[-]

sandor2@reddit

should work good on linux, if its supported by rocm

Reply

[-]

Kryohi@reddit

The big question is whether it will be called 9070XTXX or 9070 AI XT

Reply

[-]

GenericUser1983@reddit

Radeon AI 9070 XT 32 GB AI Edition.

Reply

[-]

PalpitationKooky104@reddit

or bob? Doesnt mater. Or 9070xai?

Reply

[-]

mokkat@reddit

A 9070xt with 32gb VRAM is a good idea. It caters to the crowd who needs it for work, as well as enthusiasts who want to pay more for the "best" card. They can probably earn more % profit since it doesn't need to be great value like the regular 9070xt. A "gaming" 9070xt with 32gb VRAM is stupid as heck. Not even Nvidia can put 32gb on a card and call it "gaming"

Reply

[-]

Farren246@reddit

Entirely unnecessary for a mid-range card, but there might be some buyers. If you actually need 32GB you'd want a high core count to go along with it... and most people would probably want CUDA to go along with it too.

Reply

[-]

HLumin@reddit (OP)

Nah, prolly a 9090 XTX or something

Reply

[-]

OafishWither66@reddit

they aren't gonna make a flagship card this gen

Reply

[-]

iothomas@reddit

It is alleged that when they realised that the 9070 was closer than they expected to the 5080 they pivoted to a 9080 32gb idea that could be just a few % above the 5080

Reply

[-]

Jaegs@reddit

Could just be one that they want to market to people looking to run LLMs locally too, double the memory is super useful for getting the higher parameter models to run.

Reply

[-]

animeman59@reddit

9070 XTraRAM

Reply

[-]

Krotiuz@reddit

Apparently confirmed by the leaker at the end of the article, so yes

Reply

[-]

hitsujiTMO@reddit

I'm going to take these with a pinch of salt. Why are we hearing that AMD aren't going to compete in the high end market, yet still hundreds of rumours of high end cards?

Reply

[-]

Aquaticle000@reddit

My question is, why a 9070 with that much ram specially? I’d expect that amount of VRAM for something like a 9090 or a 9090xt, which we clearly aren’t getting. At least not anytime soon.

Reply

[-]

Strazdas1@reddit

9070 is the highest tier card AMD will produce this generation.

Reply

[-]

Aquaticle000@reddit

That seems to be the trend so far, I’m aware. It was an example.

Reply

[-]

szczszqweqwe@reddit

AI in general, some productivity tasks and game mods.

Reply

[-]

Aquaticle000@reddit

This is obviously not a gaming GPU, which is why I questioned it. I don’t know why the leak is calling it one, it’s definitely not.

Reply

[-]

kontis@reddit

If someone is making cinematics/product videos in UE5 editor or doing some AI this would be an amazing card. Complex CG scenes (no game-like optimization) in UE5 editor can even crash a 24 GB card. AI models don't even start below VRAM requirement.

Reply

[-]

Aquaticle000@reddit

Now see this makes sense, but the leak claims It’s a gaming GPU, which is why I called the naming into question, it didn’t make sense.

Reply

[-]

Jeep-Eep@reddit

Buy that fucker and coast across most of the refresh and next console gen.

Reply

[-]

the_dude_that_faps@reddit

It's niche, but it gives people another reason to choose this over Nvidia.

Reply

[-]

TheAgentOfTheNine@reddit

LLMs. If you offer a lot of ram, even with a low performance, they'll fly off the shelfs.

Reply

[-]

szczszqweqwe@reddit

Why? They can just clamp another 16GB on a back of a 9070xt's PCB, do a slight OC and call it 9070xtx.

Reply

[-]

hitsujiTMO@reddit

You still need to be able to interface all the chips with the soc. That alone means a much larger soc for the extra pinouts. So they would have a whole separate chip for this one card that costs them a whole lot more and they get fewer chips per wafer. I don't buy it. If they did this it would have to be a more performance chip to make it worthwhile. It would be a 9080 or 9090 not a 9070.

Reply

[-]

Strazdas1@reddit

if you do clampshell you retain same bus width and same bandwidth, just with larger storage and significantly higher costs.

Reply

[-]

szczszqweqwe@reddit

Not when they put half of memory on a back of PCB. 4060ti and 4060ti 16GB have a die with exactly the same area: 188mm^(2) [https://www.techpowerup.com/gpu-specs/geforce-rtx-4060-ti-8-gb.c3890](https://www.techpowerup.com/gpu-specs/geforce-rtx-4060-ti-8-gb.c3890) [https://www.techpowerup.com/gpu-specs/geforce-rtx-4060-ti-16-gb.c4155](https://www.techpowerup.com/gpu-specs/geforce-rtx-4060-ti-16-gb.c4155)

Reply

[-]

Hayden247@reddit

The RX 7600 XT vs RX 7600 did the exact same thing from AMD too. Tho the XT was an OC as well so that's how they slapped the XT on. It'd make sense for AMD to clamshell the 9070 XT and make it 32GB. I'd love the 9070 XTX name for funises but apparently just 32GB, fine it is accurate. Some people wanted more than 16GB so it'll find a market even if 700USD. I mean it would be 32GB on the "cheap" since the RTX 5090 is the other 32GB GPU for well over double the price. Near triple probably.

Reply

[-]

szczszqweqwe@reddit

Yeah, I might be one of a customers.

Reply

[-]

Vushivushi@reddit

No it's called clamshell mode where you have two 32-bit memory chips sharing the same 32-bit memory controller. You don't need a new chip with a larger memory bus. You get more capacity at the same bandwidth with this configuration.

Reply

[-]

Numerlor@reddit

clamshell doesn't need additional IO

Reply

[-]

Jeep-Eep@reddit

And honestly, that thing would go for fuckin' years, so it would probably honestly be worth.

Reply

[-]

tobimai@reddit

Could just be a mid card with a ton of memory. Especialy for LLMs interesting

Reply

[-]

moschles@reddit

Article literally says "Tagged as rumor"

Reply

[-]

Visible_Witness_884@reddit

Is it highend with 32gb and a regular core?

Reply

[-]

hitsujiTMO@reddit

Fair enough, didn't see the update that it's just a 9070 xt. But I still don't believe it. There's no 4gb chips so they'd have to have a large enough soc to be able to pin out double the amount of chips and have its own board design with half the chips on the back of the board giving it a 512bit bus. If they said 24gb variant, yeah. But a 32gb variant could only be a professional card.

Reply

[-]

Visible_Witness_884@reddit

Yeah... doesn't seem like the greatest product for gamers. I guess we'll have to wait till next gen from both nVidia and AMD to upgrade if we are already anywhere north of a 7800XT / 4070 in performance.

Reply

[-]

reddit_equals_censor@reddit

should be a perfectly great product for gamers if amd just ads the vram price difference on top of the 9070 xt 16 GB version. assuming the 9070 xt 16 GB is priced aggressively, a 32 GB version would be great value short and especially longterm. 16 GB of more gddr6, that is in the worst case 60 us dollars more. probably a bunch less for amd. if amd charges 500 us dollars or 600 us dollars for the 9070 xt 16 GB and then charges 660 us dollars or 560 us dollars for the 32 GB version, people would buy them without question. just like they bought the 8 GB version of the polaris 10 cards (rx 480 for example)

Reply

[-]

Visible_Witness_884@reddit

32 gigs is just a bit useless on a semi-midrange card if it comes with a price premium. Like, if they made a 32gig 5070 - it'd just cost more and not perform better.

Reply

[-]

reddit_equals_censor@reddit

just give people the choice and let the market decide. i want my next card to have at least 32 GB vram. they can have the market decide and sell more graphics cards like they did with the 4 and 8 GB polaris 10 cards. >if they made a 32gig 5070 the 5070 with 18 or 24 GB would perform better though, because it has only 12 GB vram and with raytracing you can get into scenarios where 12 GB vram becomes an issue. nvidia even straight up told hardware unboxed, that yeah 12 GB vram won't be enough in some scenarios. nvidia sees the 5070 as the card to NOT get. the compromised card. DELIBERATELY COMPROMISED. so yeah a 24 GB (clam shell) or an 18 GB (1.5 x capacity one sided) OR a 36 GB version would perform better than the 12 GB insult already in some scenarios according to nvidia themselves when talking to reviewers... and if you automatically thought, that SURELY the 5070 has 16 GB and not 12 GB, it shows what a middle finger the 12 GB on that card is yet again. and people who might buy a 9070 xt with 32 GB might use their card or be able to use it for the entire ps6 generation so far into the future. yes lowered other settings, but max texture quality and still working just fine. maybe the ps6 will lead to 24 GB vram used at high quality settings, so bye bye 16 GB vram at high quality settings by then. and cards are used for that long if they still work. people have used (some are still using) polaris 10 cards with 8 GB vram. 8 GB vram is already a major issue, but the 4 GB version said good bye way way way way before the 8 GB vram versions. btw worth keeping in mind, that both nvidia and amd could leave it up to partners to configure the memory and let the market decide. partner putting 16 GB on a 4060 non ti for example would vastly outself the 8 GB insult if the difference in price would be just the 8 GB added memory. 24 GB or 18 GB 5070 would vastly outsell the 12 GB insult we can expect. especially nvidia does NOT want customers to have the choice on vram. it is NOT a hardware limitation at all even at a bs tiny bus. they just want to scam people and massively upsell people and have planned obsolescence. people on 12 GB vram will probably be heavily inclined to upgrade once the ps6 comes out and the first ps6 only games come out. while 18 or 24 GB version people would sit on it for 2 more years possibly as it still runs all games decently fine, if it would exist...

Reply

[-]

Visible_Witness_884@reddit

The increase in VRAM over what is used makes no performance gain. There might be a slight gain from having a higher bus width. But if there's no use in it, it doesn't matter.

Reply

[-]

reddit_equals_censor@reddit

>There might be a slight gain from having a higher bus width. the examples mentioned had no bandwidth increases. 1.5 x capacity or clam shell, or 1.5x capacity memory chips + clam shell all have the same memory bandwidth and bus. and again nvidia straight up told told tech reviewers, that yeah the 12 GB card will already have issues at launch in some games and that it is the compromised version. "why aren't you getting the working 16 GB 5070 ti, you poor pleb" is the nvidia mentality. and a disgusting mentality at that. so 16 GB over 12 GB matters according to nvidia!!! even.

Reply

[-]

Visible_Witness_884@reddit

Well, yes, if you want to play all the games with full RT and at max quality. There are loads of games where it won't matter. But I totally agree that nvidia are putting out cards with too low vram for the performance potential. But that's the thing - having absurd VRAM without the performance potential to utilize it? That's just waste of resources.

Reply

[-]

kontis@reddit

They could easily add $200 premium and it would still sell like hot cakes even if it's much slower than 5080. There is no other product like this on the market. Even for a gamer it's super cool to have so much headroom in VRAM.

Reply

[-]

reddit_equals_censor@reddit

they probably could, but i'd argue they shouldn't. they claim to want to gain marketshare that is a great way, while keeping profits the same as the 16 GB card, which will still be great profits at a 500 us dollar price point for the 9070 xt 16 GB. also would be a way to get a ton of gamers away from nvidia. a bit slower than a 5080 for vastly less money with double the vram. that is a deal to take. that is an exciting product. that is sth, that gets people to switch to amd, especially those, who got burned by an 8 GB 3070 ti already for example. and charging a 200 euro/us dollar premium for 60 us dollar (they pay probs a bunch less) for the added 16 GB vram would feel disgusting and probs get pooped on reviewers as well. people would rightfully compare it to the 4060 ti 16 GB scam pricing, that charged 100 us dollars more just to add 8 GB vram and make it a working card. now 16 GB is for now already a working graphics on the 9070 xt, but charging 200 us dollar premium for at worst 60 us dollars more vram would invite those comparisons. if they go 500 us dollar 16 GB and 560 us dollars 32 GB (or 600 and 660), then people will literally see it as a pure vram choice for people who want more vram and a pro customer move and reviewers should praise it massively. that would also be longterm thinking to gain marketshare on a good generation to do it, instead of trying to get a bit more money out of viewer cards sold...

Reply

[-]

reddit_equals_censor@reddit

wrong. clamshell. memory on both sides with the same memory bandwidth. done in the past. done on the 4060 ti 16 GB insult. so 32 GB with 256 bit bus is no problem at all with gddr6. this is also always worth keeping in mind, when a company is lying to you about "oh that is the only possible vram configruation", which is a double lie, because they decided on the memory bus and THEN on top of that decided to only put memory on one side of the card instead of both (clam shell)

Reply

[-]

Healthy_BrAd6254@reddit

7600 XT runs 16GB on 128 bit This is a 256 bit GPU

Reply

[-]

the_dude_that_faps@reddit

Just clamshell it. GDDR6X should be cheap these days.

Reply

[-]

LuminanceGayming@reddit

could be clamshell like the 4060 TI 16GB

Reply

[-]

ehxy@reddit

to be fair....I was kind of expecting a 5070RTX Ti to have AT LEAST 24GB memory but they kinda shit the bead there

Reply

[-]

gatorbater5@reddit

midrange nvidia with a 'surplus' of memory? you must be new. they always have just enough for the moment.

Reply

[-]

Raining_dicks@reddit

Inb4 24gb 5060 ti

Reply

[-]

RealOxygen@reddit

Nvidia moved the needle for what a high end card is Current gen 80 series is typical 70 series performance for 80ti/90 series pricing

Reply

[-]

Significant_L0w@reddit

they have to compete in gaming if they want to keep in track with AI race, it is actually inter connected if you think about it Besides gaming there is huge demand for high vram cards for local llm

Reply

[-]

verci0222@reddit

What's the point if the performance isn't there lmao

Reply

[-]

iMaexx_Backup@reddit

Not everything is about gaming lol

Reply

[-]

verci0222@reddit

Creatives and those looking for local ai machines are on Nvidia so I still fail to see what's the use case for this

Reply

[-]

iMaexx_Backup@reddit

Nope, not necessarily.

Reply

[-]

-Purrfection-@reddit

What kind of productivity tasks can be done on AMD?

Reply

[-]

shugthedug3@reddit

Just about any. Performance isn't usually the best.

Reply

[-]

Strazdas1@reddit

if you need to have software acceleration fallbacks then yeah, performance will suffer. But hey if you are lucky that you can deploy on AMD and not have to spend triple the unit costs on troubleshooting, go for it.

Reply

[-]

noiserr@reddit

You can run LM Studio or any other inference framework for local AI coding assistants.

Reply

[-]

iMaexx_Backup@reddit

Most of them. It’d be much easier to name the handful of tasks that can’t be done with AMD.

Reply

[-]

Mean-Professiontruth@reddit

Mentally gymnastics apparently

Reply

[-]

PorchettaM@reddit

VRAM capacity is the single most important spec for local AI. If AMD can offer 32 GB at a price point where Nvidia is offering 16, the hobbyists will be tempted to switch, even if it means having to deal with ROCm.

Reply

[-]

reddit_equals_censor@reddit

for the same hobbyists: may i interest you in a 128 GB unified memory strix halo mini pc? ;) i mean sucks, that you can't get it with ecc it seems, but yeah 128 GB with a basic high end up, when a 5090 is just 32 GB and even the 5090 workstation version would be just 96 GB... well :D smart amd marketing team (doesn't exist) would look at strix halo and at a 32 GB 9070 xt and market the shit out of vram/unified memory for gamers and professionals. show games straight up breaking down with some 8 GB mobile dedicated gpu insult and show models or 3d renders eating tons of memory, while the competition breaks or crawls to an almost hault. but that is what purely hypothetical smart amd marketing team would do among other things, so sadly it will never happen :o

Reply

[-]

devinprocess@reddit

Have you seen strix halo prices for just 32gb stuff :) They make apple blush.

Reply

[-]

noiserr@reddit

Strix Halo isn't going to be cheap and 9070xt will be faster.

Reply

[-]

reddit_equals_censor@reddit

memory size would be the thing, that matters most for people who want to run local llms. strix halo from my understanding would be fast enough and running a model, that takes up 128 GB would be vastly more interesting. will be very interesting how much amd will charge for strix halo, because theoretically it doesn't need to be super expensive in a laptop or mini pc. THEORETICALLY. both would be great options at possibly different price points. if strix halo is priced agressively, then it could be a great deal if you buy it in a laptop, because you'd also get a great laptop to carry with you, when you aren't running an llm at home theoretically. but they will probs charge a ton. either way they should just do united market for vram to the moon options for customers with both products. it is easy to market sth, when the competition is releasing broken hardware probably again (8 GB vram at laptop for nvidia we expect)

Reply

[-]

kontis@reddit

Exactly, they are on Nvidia. So how to get the OFF Nvidia? With a much higher value product per $.

Reply

[-]

Jaznavav@reddit

Inferencing on AMD *works*, even if worse than Nvidia. Between a 32 and 16gb GPU for local ai the choice is obvious, you can cope with ROCM on linux, you can't cope with having half the memory.

Reply

[-]

Archer_Key@reddit

it should be more powerful than the 7900xt, and yet it have less vram

Reply

[-]

Krugboi@reddit

They cancelled "9080/9090 xt" models believing that they cant catchup to nvidia, but after seeing miserable %10 uplift from 5080 maybe they decided to make it again?? I hope so

Reply

[-]

Strazdas1@reddit

given that we already have AMD cards on shelves, theres no way they have the time to alter the architecture now.

Reply

[-]

amazingspiderlesbian@reddit

32gb of ram would be a 5090 competitor. And they would have to somehow make something like 70% faster than the 7900xtx to compete with that.

Reply

[-]

Krugboi@reddit

They competed 4080 with xtx 24gb's vram, same vram doesnt mean performance segment especially when 5080 should've had 24gb

Reply

[-]

amazingspiderlesbian@reddit

I mean, i guess they could make a selling point of having slow gpu with tons of vram. For like AI stuff market. But it's not really good for gamers since you're going to get a gpu that would probably cost the same or more than a 5070 ti 16gb with less performance than that with a 32gb 9070xt. And like zero actual use for it outside of machine learning

Reply

[-]

Slyons89@reddit

"Gamers" in general are simple to market to, and believe bigger number better, regardless of whether they need it. It's silly, but just because it's practically useless doesn't mean people won't pay extra for it. We see this all the time with things like $500 motherboards and and 1200W power supplies.

Reply

[-]

noiserr@reddit

You don't need 32GB for gaming true. But for local LLMs this would be quite sweet. For one it should be twice as fast as Nvidia's DIGITs.

Reply

[-]

SubstantialInside428@reddit

Weird, I thought 9070XT was the only planned monolithic die for RDNA4 and that's the real reason 80 class got canned, because MCM is still too tricky/costly to make sense ?

Reply

[-]

JakeTappersCat@reddit

Imagine how depressed 5080 buyers will be when AMD releases a 9070XT 32GB for $699 that nearly matches their card in speed and has double the VRAM. Maybe it would teach people a lesson about buying nvidia at launch when they try to rip everyone off

Reply

[-]

TateEight@reddit

Yeah lol this isn’t gonna happen Shit is still gonna die as soon as your turn up RT buT i hAvE mOAr VrAM

Reply

[-]

TateEight@reddit

Can smear your screen with Vaseline with dookie ass FSR too

Reply

[-]

rTpure@reddit

AMD could make a card that's twice the speed at half the price, and people would wait for nvidia to lower their prices so they could buy a geforce instead

Reply

[-]

Reasonable_Can_5793@reddit

I’m a big fan of this idea. It’s about time they stopped making the 16GB version the high-end model. This would definitely encourage Nvidia to make the 6080 at least 24GB of VRAM, and every mid-range card should have at least 16GB instead of 12GB. And the low-end cards should have at least 12GB.

Reply

[-]

Optimal_Visual3291@reddit

Gamers don’t need 32gb. This would be aimed at “pro” type card. I’ll take my down votes.

Reply

[-]

Astigi@reddit

There is no significant improved performance in RDNA 4, just another 16GB VRAM on XTX version. Nvidia has never been worried about AMD

Reply

[-]

TigerBalmES@reddit

I’ll definitely buy that over ngreedia products

Reply

[-]

EntertainmentMean611@reddit

Honestly... a slower core and give it 128g of vram .. even with slower vram and it would sell like nuts for local AI hosting.

Reply

[-]

Gronkers@reddit

Could finally load all those skyrim mods...

Reply

[-]

ea_man@reddit

This would make so much sense that AMD will never contemplate it, they would send wagons to the AI folks.

Reply

[-]

Doubleyoupee@reddit

Absent 9080xt confirmed. That's why they needed some more time!

Reply

[-]

SubstantialInside428@reddit

Hope not, I don't want buyer's remorse for taking the 9070XT

Reply

[-]

PalpitationKooky104@reddit

more remorse for a 5080 at1300

Reply

[-]

SubstantialInside428@reddit

Those buyers deserve hell

Reply

[-]

funix@reddit

So we're moving into an era where our GPUs will have more RAM than our motherboards.

Reply

[-]

Stennan@reddit

This doesn't sound like the worst idea AMD has come up with ("Poor Volta" would fit that description). DDR6 doesn't have that many customers, so 16GB should cost around 40-50$. Add a bit more for a sandwiched PCB, and AMD could launch a 32GB "9070" below 1000$.

Reply

[-]

Jeep-Eep@reddit

We'll probably see similar with 16 gig 9060s and the like.

Reply

[-]

SnowZzInJuly@reddit

And it’ll have all the same inventory issues all high end cpu parts do now. You won’t get one until 2 years later. Enjoy 😊

Reply

[-]

Fer-Butterscotch@reddit

Eh, it's a bit niche. Paying more for no appreciable extra performance in most gaming scenarios seems like it'll keep demand down among people who won't legitimately utilise the extra ram. Or sheeple will sheeple and buy it to get high huffing the spec sheet. One of the two.

Reply

[-]

Jeep-Eep@reddit

I mean, I'd go out of my way to grab it because radeon double VRAM SKUs have a long history of being enduring performers.

Reply

[-]

MiloIsTheBest@reddit

I would definitely huff that spec sheet. I don't *quite* feel like 32gb is completely necessary but I certainly wanted *something* to offer more than 16gb this generation without costing $5000AUD A 24GB 5080 would've suited me just fine

Reply

[-]

Turtvaiz@reddit

> A 24GB 5080 would've suited me just fine That seems rather likely to be releasing later on as a super model

Reply

[-]

Fer-Butterscotch@reddit

Yeah, look, I'd buy one just to throw some weird shit at it and see how it handles it, but it wouldn't be games I don't think. I don't know too much, but my impression was that 16gb is plenty for 4k, and I don't think the compute power is there for 4k triple screen in any real way? Not sure tho, happy to be proven wrong.

Reply

[-]

MiloIsTheBest@reddit

How much is plenty depends on the game and what assets it would want to load. The more detailed and more expansive games are the more they require regardless of resolution. I want something that will not have an issue storing assets for quite some time. And as far as I'm concerned 16GB today is 8GB 5 years ago. Maybe handles pretty much everything *now*, but this thing's gotta last me a couple of generations. For me, I like to Flight Sim (for the scenery, I'm a filthy casual) and I play a few games which, thanks to an under provisioned GPU, run afoul of VRAM long before the compute performance is too low. For my next card, too much is far preferable to 'probably enough', and none of the regular cards have been announced with anything more than "probably enough".

Reply

[-]

NGGKroze@reddit

Those while gamers try to buy them will be scalped or brough for LLM use due to high VRAM.

Reply

[-]

Fer-Butterscotch@reddit

My point is that there are very very few gamers who have a reason to buy this just for gaming. And that's a good thing. It's a niche product and I'd love to see it get made.

Reply

[-]

Visible_Witness_884@reddit

2 years later and at higher price than when it launched.

Reply

[-]

ButtPlugForPM@reddit

Hopefully they come out swinging with UDNA give something thats like 60 percent on top of a 7900xt.. but go really off book,and do multichiplet,and give it HBM for crazy bandwidth..

Reply

[-]

ga_st@reddit

Yes please. This, and unusable 150x multi frame gen just to fuck with Nvidia's "performance" graphs. Please AMD.

Reply

[-]

reddit_equals_censor@reddit

i do want to see a meme marketing slide, that shows a 16x fake interpolated frame gen mode. laughing it up at nvidia's bs and then moving on to ACTUAL performance in the next slide. moore's law is dead made a lil short showing sth like that: [https://www.youtube.com/shorts/XYWPKbbwS18](https://www.youtube.com/shorts/XYWPKbbwS18) but that would be clever marketing, so we won't see that from amd ;)

Reply

[-]

ga_st@reddit

I see, I don't really follow or watch anything from the guy, but I guess he making the same joke is the reason why my post is being downvoted. That, or people who really like to be shafted by alligator leather jacket man. So many on this sub. Some people just like the whip, you know? And jacked boy knows his customers very well, so whip it is. There are people ITT doing the *how can she slap face* because they can't fathom the possibility of a 70/80 class GPU having 32GB of VRAM in big 2025. Those people have been scarred by Nvidia, it's so sad to watch. In any case yea, if I were AMD I'd really do that. After that, I'd go on a divergent path, do my own thing compared to Nvidia, advance graphics, sustainably, together with the community and partners, which are plenty and influential. AMD owns the whole console and handheld market, they should really stop chasing Nvidia's shenanigans. Those saying AMD should follow Nvidia are monumental morons.

Reply

[-]

Ch1kuwa@reddit

Double the VRAM, double the idle power consumption on a multi-monitor setup.

Reply

[-]

Reggitor360@reddit

So like the 3090/5080/90 did since launch and no one cared about the 90-125W idle?

Reply

[-]

AreYouAWiiizard@reddit

Maybe they realized they can add a heap of VRAM and dump a heap of power into the 9070XT and sell it as a 9080 for extra profit since the 5080 showed a disappointing increase?

Reply

[-]

GenZia@reddit

Since I'm no AI/Llama/DeepSeek expert (to put it mildly), would anyone mind explaining exactly how reliant these LLM models are on bandwidth? If all they need is VRAM and don't depend heavily on bandwidth (or at least don’t require a ton of it), then it might not be the worst idea to explore LPDDR-X instead of the more traditional GDDR or HBM. For example, Samsung currently offers 128Gb @ 32-bit LPDDR5X running at 7.5 Gbps. That means you could have up to 128GB on a 256-bit wide bus, though the bandwidth would be roughly comparable to something like a GTX 1070 Ti (8 Gbps GDDR5X). That said, I’m not entirely sure if 128GB would be enough to run the full-blown 236B parameter model locally at 4-bit.

Reply

[-]

noiserr@reddit

Bandwidth is important. 9070xt has 624.1 GB/s bandwidth which is nothing to sneeze at. It's slower than 3090, 7900xtx but not by much. To put it in perspective this is still twice as much bandwidth that Strix Halo or Nvidia's DIGITS.

Reply

[-]

koolaidismything@reddit

Even low end GPUs mostly have more RAM than my whole system. I need to go look at a modern AAA game. I don’t game the last one I saw and was like woah was Arma maybe.. like 2017

Reply

[-]

lordofthedrones@reddit

Compute card but gaming pricing? Please, please be real.

Reply

[-]

_Restless_Spirit_@reddit

Oh my god, yes please. With local LLMs, the more VRAM the better.

Reply

[-]

moschles@reddit

What software can utilize AMD GPUs to train LLMs? (Unless you meant local inference? )

Reply

[-]

noiserr@reddit

You aren't going to train models on desktop GPUs. You usually need much more compute and datacenter GPUs for that. You can fine tune on Radeon using qLoRa though. And yes inference.

Reply

[-]

noiserr@reddit

If true this is great news for local llama enthusiasts. Literally what I've been waiting for.

Reply

[-]

noiserr@reddit

This is all I ask for. Fuck yes.

Reply

[-]

PeakBrave8235@reddit

Meanwhile M4 Max is 3090 level desktop performance with 128 GB of graphics memory.

Reply

[-]

Fer-Butterscotch@reddit

What's the cheapest you can get an M4 Max?

Reply

[-]

PeakBrave8235@reddit

Sounds like a rhetorical question so you should try to make your point you want to make

Reply

[-]

Fer-Butterscotch@reddit

Honestly asking, I dont know Macs. There earnt many options for 32gb cards, so even a few grand is not really insane.

Reply

[-]

PeakBrave8235@reddit

Oh gotcha! Well in desktop form it’s likely to start at $2,000 USD. The current M2 Max desktop model starts at $2,000 for 32 GB.

Reply

[-]

Fer-Butterscotch@reddit

OK, that's pretty impressive since I think you're currently looking at the workstation type cards which run thousands of USD to get 32 gb and decent GPU power. Good call.

Reply

[-]

PeakBrave8235@reddit

I agree totally! The rumored Apple M4U chip will likely be on the level of a 4090 Desktop GPU, which is based on past Blender results and extrapolation And it’s also rumored to have 256 GB of memory. So you cohld have 4090 levels of power with 256 GB of graphics memory. And because the memory is unified (and not shared), it means you truly have access to all that memory for anything you want to do!

Reply

[-]

Fer-Butterscotch@reddit

Freaking ay, that's what having design control of the end to end platform and a ready made consummer base will do for you! I gotta admit, I was puzzled when Apple said they were going to bring chip making in-house, but it was a 100% genius move.

Reply

[-]

PeakBrave8235@reddit

I agree! Exactly! One more thing. I’ve seen some videos of an Apple MLX Engineer (MLX and Metal are most equivalent to “CUDA”) testing those Large transformer models, and he was able to connect 3 M2U chips with 192 GB of memory each, and while running a LLM machine learning model, the whole thing only took 100 watts of energy at its peak, with the average being 60 watts for all 3 Macs together.

Reply

[-]

Fer-Butterscotch@reddit

Yeah, I'd heard about that idea of interconnecting them, but I'd forgotten about it. Very cool, exciting times.

Reply

[-]

szczszqweqwe@reddit

So a 4070s performance with lots of memory, but it's a shared memory, not GPU only memory, still at 128GB it doesn't make much difference in most cases.

Reply

[-]

PeakBrave8235@reddit

It absolutely makes a difference lol. The memory isn’t “shared,” rather it’s stored in one place. Instead of one pool of memory being split between the CPU and GPU, all data is stored in the memory and can be simultaneously accessed and manipulated by both the CPU and GPU. It’s not like shared memory models such as Intel’s iGPU

Reply

[-]

szczszqweqwe@reddit

You are right, I meant a single pool of memory for both without any division. I haven't got iGPU for a while and it definitely shows.

Reply

[-]

PeakBrave8235@reddit

Well traditional iGPUs use shared memory, which is taking a portion of memory and allocating it. The CPU or GPU cannot directly manipulate it without first having to copy the data and send it to either (example CPU wants to manipulate data that the GPU is working on, it needs to first copy that data) Whereas the unified model is one single memory pool where both the CPU and GPU can access data without needing to copy it first. It speeds stuff up and saves on power, while also providing large amounts of memory. LPDDR has lower bandwidth than GDDR or HBM, but the benefit is 1) large amounts of it, 2) low energy costs, 3) not as expensive.

Reply

[-]

Kionera@reddit

AMD has Strix Halo with 96GB of shared memory, there are several mini desktops being made around it.

Reply

[-]

PeakBrave8235@reddit

Yes that’s true, but Strix Halo doesn’t match the 3090 desktop in performance.

Reply

[-]

AutoModerator@reddit

Hello HLumin! Please **double check that this submission is original reporting and is not an unverified rumor or repost** that does not rise to the standards of /r/hardware. If this link is reporting on the work of another site/source or is an unverified rumor, please delete this submission. If this warning is in error, please report this comment and we will remove it. *I am a bot, and this action was performed automatically. Please [contact the moderators of this subreddit](/message/compose/?to=/r/hardware) if you have any questions or concerns.*

Reply