TheaterFire

AMD reportedly working on gaming Radeon RX 9000 GPU with 32GB memory

Posted by HLumin@reddit | hardware | View on Reddit | 261 comments

Reply to Post

261 Comments

NGGKroze@reddit

If this is true, say bye bye to 599 9070XT 16GB. 599$ for 9070 799$ for 9070XT 999$ for 9070XT 32GB - you are getting 5090 VRAM, but for half the money (MSRP wise) while also being in performance between 4070Ti Super and 4080S. Problem with with such high VRAM, those will be scalped so hard so either you won't find any or you will find them, but lot higher than MSRP.
View on Reddit #48357171

reddit_equals_censor@reddit

>Problem with with such high VRAM, those will be scalped so hard so either you won't find any or you will find them, but lot higher than MSRP. mid size dies with dirt cheap gddr6 and basic packaging. and a not new node. amd can pump those out way beyond scalping possibility. i mean hell they added 2 more months of production already with the delay of the cards. there is also 0 uniqueness about the 32 GB version. same die clam shell and done, so amd could easily shift most of their production to the 32 GB version if they are vastly more in demand.
View on Reddit #48360188

Jeep-Eep@reddit

Yeah, RDNA 4 was designed with manufacturability and cost effect first in mind, in that order. They've acting like the console guys, not trying to catch nVidia this gen.
View on Reddit #48374944

reddit_equals_censor@reddit

>not trying to catch nVidia this gen. it is worth remembering, that amd has a long history of just making midrange cards every other generation even often. so it is nothing new. now arguably with how shit the 50 series is, the cancelled chiplet gddr7 monster high end gpu could be seen as a mistake, but who could have guessed, that the 50 series will be such a piece of shit/stand still. and in regards to catching nvidia, if the leaks, which we got plenty by now are true, then amd is on part or ahead in architecture. amd performance/cost of the whole card should be ahead of nvidia, as they are still using dirt cheap gddr6, instead of gddr7 even. raytracing performance being massively up and fsr4 upscaling being a big step forward. so architecturaly they seem to be at bare minimum on par with nvidia, which is very impressive. and that is the hard part. making a high end version of an architecture just in general is not the hard part. although in rdna4's specific case it would been uniquely harder, because instead of just a massively bigger monolithic die, they went insanely chiplet based with split cores (rdna3 chiplet is child's play in comparison)
View on Reddit #48418765

Jeep-Eep@reddit

Goddamn, when I called nVidia's AI obsession and its repercussions their 'intel node moment'... damn.
View on Reddit #48425424

Mnmemx@reddit

there are a lot of companies you can reasonably accuse of "ai obsession", but probably not the one that is "selling the pickaxes" so to speak who currently has a license to convert all available tsmc allocation directly into money because the industry has an insatiable demand for nvidia GPUs regardless of how gamers view the latest RTX cards
View on Reddit #48622567

Jeep-Eep@reddit

It is when it fucks up their ability to make other sorts of pickaxe.
View on Reddit #48622595

pyr0kid@reddit

they ARE the console guys, they design the parts for xbox/playstation
View on Reddit #48408221

Jeep-Eep@reddit

thatisthejoke.bat Yeah, and RDNA 4 seems to be them starting to more fully apply those learned lessons.
View on Reddit #48425030

Morningst4r@reddit

People said the same thing about RDNA2, yet there was hardly any supply when NVIDIA GPUs were going for 3x MSRP. I hope they do pull it off but I’m not holding my breath.
View on Reddit #48400597

noiserr@reddit

Why would the existence of a double vRAM model affect other models? I'm not following.
View on Reddit #48364258

PalpitationKooky104@reddit

It dont 499, 599, 749. Would work for 9070's
View on Reddit #48381264

NeroClaudius199907@reddit

Amd wouldnt dare. Youtubers & gamers will be very upset
View on Reddit #48358268

moschles@reddit

The world desperately needs someone to migrate Pytorch to run on AMD (sans CUDA)
View on Reddit #48361261

noiserr@reddit

This has been done for awhile now.
View on Reddit #48363582

Strazdas1@reddit

The issue is you need to use ROCm. Most times, you cant use ROCm because theres some fundamental issues AMD refuse to address.
View on Reddit #48445249

noiserr@reddit

Which version of ROCm have you used the last time? I've been doing a lot of pytorch development and it stuff just works.
View on Reddit #48449997

Strazdas1@reddit

I dont remeber the version number. It was a bit over a year ago. We ran into an issue we couldnt solve. There was no documentation about it. Contacting AMD lead to nothing (basically got told to fix it ourselves). Eventually department head decided that it would be cheaper to just switch to Nvidia instead of trying to fix ROCm.
View on Reddit #48531437

moschles@reddit

I'm hearing rumors that ROCm is "experimental" and nobody is seriously maintaining it.
View on Reddit #48364361

noiserr@reddit

ROCm has made a lot of progress in the past year. AMD has literally sold over $5B worth of datacenter GPUs in 2024. This is a serious effort.
View on Reddit #48364475

szczszqweqwe@reddit

Even if it's a 32GB 9070xt it would be a cool thing, especially if it's priced well. They can OC it and call it 9070xtx 32GB.
View on Reddit #48356480

Aquaticle000@reddit

But why a *9070*? I feel like something with that much VRAM would be reserved for like a 9090 or 9090xt, like the 7900xtx with 24GB and the 7900xt with 20GB. This just seems really out of place?
View on Reddit #48356733

MiloIsTheBest@reddit

The 9070 XT that's supposed to be positioned somewhere between the 20gb 7900XT and 24GB XTX? Frankly I'm annoyed it doesn't *already* have 24GB
View on Reddit #48357289

reddit_equals_censor@reddit

>Frankly I'm annoyed it doesn't *already* have 24GB i think price should define if 16 GB is fine or not for now. a 500 euro 9070 xt with 16 GB seems fine this generation. a 600 euro one.. meh. having a double vram edition for the memory difference cost only would make up for that though. the rx 480 4 GB and 8 GB did that and people loved that. choice instead of scams.
View on Reddit #48360049

Merdiso@reddit

Some of you are really delusional, 500 EUR would put it at 449$ MSRP tops, there's no way it will be that low, you're literally setting yourself for disappointments or just looking for excuses to buy nVIDIA.
View on Reddit #48376260

reddit_equals_censor@reddit

500 euros may be 500 us dollars, based on theft and inflation theft at the time. and worth keeping in mind, that 450 us dollars for a 9070 xt would still mean lots of profit for amd... just to keep that in mind. >or just looking for excuses to buy nVIDIA. that is kind of hard for me, as i won't put a fire hazard 12 pin device in my home, nor will i torture myself with nvidia's gnu + linux proprietary drivers, nor will i throw money at missing vram. so wrong person to put that suggestion up :D and disapointment is all but assured by amd's leadership and especially marketing team. <points at last few releases with bad reviews, bad prices followed by price decreases shortly after, so that amd can anjoy bad reviews with acceptable prices at the same time :D
View on Reddit #48419204

Strazdas1@reddit

>based on theft and inflation theft at the time. are you sure theft is the right word? it makes no sense.
View on Reddit #48444935

reddit_equals_censor@reddit

inflation is fully artificially created by physical or digital money printing, that may or may not have any backing. as the nuked value is now in the printed money and as that printed money is used by the kakistocracy and doesn't go the public, that is indeed theft! here is a great lil meme video with lovely music, that explains it in a very nice way ;) [https://www.youtube.com/watch?v=WEMCYBPUR00](https://www.youtube.com/watch?v=WEMCYBPUR00) the graph in the background is the value of the us dollar ;) and important to understand, that inflation is NOT inherent to any money system in society. any such idea is complete nonsense. we had tons of money systems without inflation in the past. to name just one commonly known one, directly value backed currency or direct value in the currency. as in gold backed currency or silver backed currency, or direct silver or gold coins with fixed weight. so yeah the added theft in the form of "tariffs" is far from the only theft from the government. but any more than that would e too far off from the hardware topic. but it makes sense to know when you look at hardware prices over 2 decades with money printers going brrrrrrrrrrrrrrr very hard to devalue YOUR money.
View on Reddit #48447025

Strazdas1@reddit

while money printing encourages inflation, thats not its only source. i understand your dislike for inflation, and we could discuss this somewher else as this isnt the right place if you want, but i dont agree it should be considered theft. And tariffs are only theft if you are one of those "all taxes are theft" people.
View on Reddit #48531322

Strazdas1@reddit

449 msrp tops is what it would need to be if they want to stop loosing market share.
View on Reddit #48444904

Merdiso@reddit

So basically half the price of 5070 Ti in the "real world" ? AMD is not charity, you know.
View on Reddit #48445102

Strazdas1@reddit

Half the features would require half the price. and AMD must bring a very good value proposition if it wants to gain market share.
View on Reddit #48447187

noiserr@reddit

16GB is fine for gamers. Offering a 32GB version for a little extra is perfect imo. People who need 32GB will easily pay an extra $100 or so for it.
View on Reddit #48364158

reddit_equals_censor@reddit

almost as if one shouldn't throw their naming over board to match the competition, that is already full of shit with their naming :D an 8800 XTX 32 G would for example slot in way better naming wise if they wanted to, but nah.... let's not do that :D gotta make sure, that no one has any idea what an amd card's name means even enthusiasts, while the competition, that owns the market has scamed people with names, but kept them consistent for years and years.... uh shit! what if amd already changes names completely before the 32 GB version of the 9070 xt comes out ;)
View on Reddit #48359958

RedPanda888@reddit

Remember Nvidia released the 4060ti with 16GB vram, which now is still equal to the 5080 vram. Helps throw a bone to people who don’t need top top tier gaming but need the vram at a lower cost.
View on Reddit #48358438

Jaznavav@reddit

Because it's their top die this generation, you can't go higher.
View on Reddit #48358332

szczszqweqwe@reddit

Well, as far as we know a bigger die doesn't exist, it will take a lot of time to resurrect it, as they abandoned it long time ago. If they want to release it soon-ish 9070xt and memory on a back of a PCB is probably the only way.
View on Reddit #48357371

SubstantialInside428@reddit

70/80/90 class are separated by their DIE class/design, not the RAM
View on Reddit #48357063

Jeep-Eep@reddit

We did see some power specs that suggest some aggro factory overclocks.
View on Reddit #48388418

FormalIllustrator5@reddit

Spoiler alert - will be using 12V-Nshitia cable..
View on Reddit #48357206

FormalIllustrator5@reddit

To all downvoters... https://videocardz.com/newz/sapphire-radeon-rx-9070-xt-nitro-pictures-reveal-hidden-16-pin-power-connector-design Lol...or you are just Nshitia fans?
View on Reddit #48530301

Reggitor360@reddit

Only Taichi, since AsRock doing AssRock moves agin
View on Reddit #48367802

reddit_equals_censor@reddit

if amd aren't idiots, then they will release a 9070 xt with 32 GB, that has only the memory cost difference added to the 9070 xt price. why? because the profit stays the same. it gives them a great vrm argument, especially if they ONLY release 16 GB cards and up this generation and it makes it look better to the 5080 insult and even the 5090, if you want a card to use for a long while or need more vram for other applications already. that would be the smart move, that would be the "gain market share" move. so amd marketing might fight tooth and nail from letting that happen :D
View on Reddit #48359503

Ascender4ever@reddit

This idea, is exactly spot on.  Hope greed doesn't get in the way of patience and sheer logic.  If I'm even considering buying the 32gb version after a solid no to the 16gb version due to pcie 4.0 rather than pcie 5.0 That! Is exactly what would do it for me and possibly countless others.
View on Reddit #48501848

Plank_With_A_Nail_In@reddit

So anyone who was in the market for a 9070 16GB is now going to skip it and wait for the 32GB version, well done AMD.
View on Reddit #48455655

Ascender4ever@reddit

I was going to get the 16gb version until rumors came out it was only pcie 4.0, that is what changed my mind.  32gb made me possibly think twice about waiting for pcie 5.0
View on Reddit #48499860

Ascender4ever@reddit

I believe, they would want as many people to be happy as possible.  No, it will not cut the amount of devs, yes, it would indeed bring more people to AMD. If the new 9000 series is indeed pcie 4.0 and 32gb vram I am now tempted to get one even though I was waiting for pcie 5.0 simply because of how much Vram is stated as being placed within the card.  No, developers will always choose what works best for them either NVIDIA or AMD or Intel no matter what gamers get it would only help the developers bring in more cash as it would open the door for one reason more a game should cost more.  They aren't going to care if consumers have a card as powerful as theirs that just means a finer product.  Inflation will indeed happen eventually, hopefully so will paychecks, but without some reason to build a game we will never have them as without this new ram capacity there would be no need.
View on Reddit #48498713

Ascender4ever@reddit

P.s. while the developer market is needed.. having a program for seasoned developers to receive a discount or free cards seeing that development is like what? No more than 1-10% of the market share (I could be wrong I'm guessing) but that way through a special program they would get a free card so long as they optimized their games for AMD would be a wise move.  Also, if you like the company you buy video cards from solely for the sake of buying from that company, just beware of burning cables (speaking about 12VHPWR).  So have fun and beware of all the ups and downs of every company.
View on Reddit #48499493

From-UoM@reddit

Its almost certainly the Radeon Pro Line which has 2x memory with double sided vram Here is the W7900 (70990xtx) with 48 GB [https://www.amd.com/en/products/graphics/workstations/radeon-pro/w7900.html](https://www.amd.com/en/products/graphics/workstations/radeon-pro/w7900.html)
View on Reddit #48357002

mckirkus@reddit

Hard disagree. You can buy a 16GB 7600xt for $300. The cost of doubling the VRAM to 32GB can't be that much more than the cost of the entire card. Maybe I'm not factoring in bus width, or different, more dense chips, but my kids old 3060 has 12GB. For AI, and rendering tasks, just having enough RAM is the most important barrier, how fast it works after that is important but not like that first hurdle. For what I do, CFD can use two GPUs (FluidX3D with OpenCL), and Ollama for AI, BOTH can use more than one GPU and it's additive. So getting two GPUs that total 64GB for around $2k would change everything.
View on Reddit #48367429

From-UoM@reddit

This is not a gaming card. Or else they could have have launched this too with Gaming rdna3 32 GB but didn't The 32 GB based on rdna3 exists btw https://www.amd.com/en/products/graphics/workstations/radeon-pro/w7800.html
View on Reddit #48367769

mckirkus@reddit

Your argument is that because they haven't released it in the past, they will never release one? 24GB consumer cards exist, but 32GB is impossible?
View on Reddit #48368535

From-UoM@reddit

24GB exists because of the 384 bit bus 384 bit bus allowed for the 48 GB pro card. Nvidia does the same. 24 GB of the 4090 and the L40/RTX 6000 Ada has 48 The 5090 will get a pro card 64 GB and possiblly 96 GB variant. The 5080 will get turned into the 32 GB pro card. Pro cards will get the full future set including fully unlocked fp16 perf (on atleast Nvidia cards) with additional software and enterprise support.
View on Reddit #48369195

HilLiedTroopsDied@reddit

valid points but there is such a thing as 3gb chips instead of 2.
View on Reddit #48483542

ResponsibleJudge3172@reddit

And why would you assume they will do what they have never done, using knowledge that they have always had pro cards with double sided VRAM? Why would they cannibalise their pro lineup?
View on Reddit #48369919

PalpitationKooky104@reddit

To support a 5b and growing ai business, and at the same time take gaming market share
View on Reddit #48378327

mckirkus@reddit

Because it would also cannibalize Nvidia's pro lineup.
View on Reddit #48377898

Vb_33@reddit

It is a gaming card the post has been edited as the leaker confirmed it. 
View on Reddit #48379449

Tuna-Fish2@reddit

32GB Navi48 would not have a different bus width or denser chips, but just clamshell ram. The spot price for 16Gbit GDDR6 modules is ~$8, add some premium for the top speed grade and a bit more for a more expensive PCB, the cost add to AMD would certainly be below $80.
View on Reddit #48369072

PalpitationKooky104@reddit

so 499 for 9070, 599 for 9070xt, 749 for 9070xtx 32gb. Who would buy 5070ti or 5080?
View on Reddit #48378665

ResponsibleJudge3172@reddit

Same with 48GB Navi31 workstation and 48GB rtx 4090 based workstation cards
View on Reddit #48369968

Tuna-Fish2@reddit

Yep. I made no claim on the price difference between the card with doubled ram config and base, only clarified the cost difference. AMD always gets much higher margins from Pro-grade stuff.
View on Reddit #48370162

JakeTappersCat@reddit

It says "gaming" explicitly. So, are you saying they're lying?
View on Reddit #48369547

Strazdas1@reddit

its AMD marketing department, what do you think?
View on Reddit #48444645

the_dude_that_faps@reddit

Sadly, this is what I think it is. I doubt they would cannibalize their professional SKUs with a consumer 32 GB GPU.
View on Reddit #48357725

liaminwales@reddit

May be a second Radeon VII, failed pro cards? [https://www.techpowerup.com/gpu-specs/radeon-vii.c3358](https://www.techpowerup.com/gpu-specs/radeon-vii.c3358) Always wanted one, just there failure rates seemed a tad high.
View on Reddit #48410092

the_dude_that_faps@reddit

I would've bought one if they made sense vs the competition. But then again, I had a 1080ti back then. It would've been a sidegrade and the crypto boom made them unubtanium.
View on Reddit #48417341

HilLiedTroopsDied@reddit

how would amd cannibalize a market which hardly even exists for them? hah. Just give us huge RAM, we won't buy their pro line up anyways
View on Reddit #48403921

the_dude_that_faps@reddit

I'm all for it, and would pay extra as long as the extra was reasonable. But it's AMD, if it makes sense for consumers, they probably won't do it.  They're not the same company that brought us the first few Ryzen CPUs.
View on Reddit #48413345

PorchettaM@reddit

Considering the state of ROCm, sacrificing some of their Pro margins in the short term to bring more devs into their ecosystem would probably be a smart move. Which means AMD is unlikely to do it.
View on Reddit #48358370

kontis@reddit

It would be a bold and desperately needed move by AMD. It would be incredible if true. This would have the greatest chance of destroying CUDA moat. Tons of GitHub projects would try to support this card, by pushing for agnostic alternatives like TinyGrad. Heck, I'm in the "CUDA is required for my work" camp and I would buy this card in a heartbeat.
View on Reddit #48361281

HotRoderX@reddit

I doubt it, the reason being... simple AMD has a track record of picking the ball up and dropping it with in 1-2 generations. There going to need to prove there reliable first. Otherwise why put in the work and effort. Nividia if nothing else is reliable you know there going to continue down the same path and innovate. Where AMD might have a brain fart decide to go branch off in a completely unrelated direction while completely dropping what they were doing and ignoring it.
View on Reddit #48404713

Extra-Advisor7354@reddit

That’s a delusional take .
View on Reddit #48377869

Banned_Dont_Care@reddit

>That’s a delusional take . Oh, would you let the boy dream!
View on Reddit #48390319

topazsparrow@reddit

> would probably be a smart move. Which means AMD is unlikely to do it. LOL. 💀
View on Reddit #48385838

Apprehensive-Buy3340@reddit

It is their stated objective to unify the software stack so every consumer can also be a dev, almost 20 years after nvidia did the same with CUDA.
View on Reddit #48359671

ProperCollar-@reddit

That's what Vega and VII were. It's not AMD is trying this for the first time.
View on Reddit #48378693

BiomassDenial@reddit

Basically, the 9000 series is a placeholder and the last of RDNA cards and whatever they do next will be on their new UDNA architecture.
View on Reddit #48366230

Jonny_H@reddit

It was unified back when OpenCL and co. were the focus, that just fell apart and not maintained as the focus changed. Also about the same time AMD nearly went bankrupt, if we're looking at reasons *why* that might be :P
View on Reddit #48365091

acc_agg@reddit

>Which means AMD is unlikely to do it. Beat me to it lol.
View on Reddit #48361015

PalpitationKooky104@reddit

so fake story? Or you just bash amd for fun?
View on Reddit #48377966

ExtendedDeadline@reddit

This hits hard :(.
View on Reddit #48366196

lordofthedrones@reddit

I want to dream
View on Reddit #48362174

Gansaru87@reddit

I just bought a 7900xtx, it better fuckin be a workstation gpu.
View on Reddit #48379872

simplyh@reddit

> W7900 $5k?? What is this, an NVIDIA GPU?
View on Reddit #48376207

Typical-Common3955@reddit

That.... Makes as much sense as the 4060 ti 16gb version... *Le sigh*
View on Reddit #48426981

AssholeFramed@reddit

Jokes on you, the 4060 ti 16gb does actually make sense.
View on Reddit #48478299

OafishWither66@reddit

9070XT with 32gb vram?
View on Reddit #48356200

FuturePastNow@reddit

They *could* use 3GB chips to make a 24GB model. It wouldn't have enough memory bandwidth but that's never stopped GPU makers in the past.
View on Reddit #48386833

pmth@reddit

How are you able to figure out that that the memory bandwidth wouldn’t be enough to make use of 24gb? I’m not sure how the calculations work.
View on Reddit #48415758

FuturePastNow@reddit

It depends on the workload I suppose. AI stuff apparently only needs capacity rather than bandwidth so if that is the target market maybe it will be fine. But with GDDR6 the 9070s will have 640GB/s of memory bandwidth, compare that to the 5090's *1792*GB/s with GDDR7 and double the memory bus width. This is only part of the equation for a card's performance, but games won't benefit much from just doubling the RAM on a card like this.
View on Reddit #48416279

Strazdas1@reddit

AI workloads are bandwidth starved almost universally.
View on Reddit #48444806

Plank_With_A_Nail_In@reddit

If the model you want to run doesn't fit in VRAM then bandwidth don't mean shit as you are stuck using the CPU and performance will be awful. More VRAM is always better for AI workloads.
View on Reddit #48455884

Plank_With_A_Nail_In@reddit

He's also assuming gaming is the only use of GPU's.
View on Reddit #48455787

Strazdas1@reddit

They are using GDDR6 for this gen, so no 3 GB chips for AMD.
View on Reddit #48444778

Kionera@reddit

This could be the 5090 killer for AI tasks, but knowing AMD they're probably gonna fuck something up.
View on Reddit #48357126

Tiny-Independent273@reddit

if it is 9070 XT then it's just the extra VRAM so I wouldn't expect it to be as powerful for AI
View on Reddit #48357511

Jaznavav@reddit

Speed doesn't really matter for hobbyist AI tasks, VRAM is much bigger barrier.
View on Reddit #48358150

Strazdas1@reddit

Both are important.
View on Reddit #48444742

Terrh@reddit

You can buy a dirt cheap 16gb card already from amd, MI series or a Vega FE. There are 32gb MI series cards as well.
View on Reddit #48367564

Jaznavav@reddit

Sure. Can it game?
View on Reddit #48367831

Terrh@reddit

Mine still does ok for being as old as it is.
View on Reddit #48368543

bill_cipher1996@reddit

That DeepSeek model just needs a lot of VRAM, performance wise it runs great on even older AMD Hardware.
View on Reddit #48357581

Mean-Professiontruth@reddit

It needs hundreds of vram..
View on Reddit #48358208

F9-0021@reddit

The big models, sure. But there are smaller models that are less generalized or weaker that will fit in consumer amounts of VRAM.
View on Reddit #48393563

ea_man@reddit

[https://www.reddit.com/r/selfhosted/comments/1ic8zil/yes\_you\_can\_run\_deepseekr1\_locally\_on\_your\_device/](https://www.reddit.com/r/selfhosted/comments/1ic8zil/yes_you_can_run_deepseekr1_locally_on_your_device/)
View on Reddit #48387487

callanrocks@reddit

Run it on an old server with a TB of the stuff.
View on Reddit #48363046

Apprehensive-Buy3340@reddit

Yes, you'd need multiple cards whatever you buy, so you'd want the cheapest vram/price ratio card that sums up to R1's total size within whatever PCIe slots you can get (mining cards got to 8 slots I think?)
View on Reddit #48359602

gurugabrielpradipaka@reddit

Yep, with my 6900XT I can just use the model with 20B parameters. 32GB would be fantastic 😻
View on Reddit #48363605

Kionera@reddit

It still competes with the 5090 in being able to run larger models.
View on Reddit #48357932

sheldonrong@reddit

$799 I will buy it for sure for AI!
View on Reddit #48360461

PalpitationKooky104@reddit

Could be 749.
View on Reddit #48379043

NeroClaudius199907@reddit

Like 7900xtx and 4090s?
View on Reddit #48358172

noiserr@reddit

7900xtx is great for LLMs. It's not as fast as the 4090 but it's not that slow either. For half the price it's a much better option actually.
View on Reddit #48361685

Kionera@reddit

Those have 8GB less VRAM, and the 4090 costs way more. The 9070XT should also outperform the 7900XTX in AI tasks based on all the AI upgrades to Radeon driver features for RDNA4.
View on Reddit #48359213

SomniumOv@reddit

> but knowing AMD they're probably gonna fuck something up. Watch it not support ROCm. Or get a mere 2 years of support.
View on Reddit #48358082

iamthewhatt@reddit

It will no doubt "support" ROCm, but to what extent is entirely up to AMD's dev team. If they put even half the effort into the GPU software stack that nVidia does, they would be far more competitive.
View on Reddit #48369313

Strazdas1@reddit

Given how muc heffort into ROCm they put so far, id say thats highly unlikely.
View on Reddit #48444713

FinBenton@reddit

Only for some spesific things, every new open source tool and library you see coming out is build for cuda but if you are just infering a regular LLM then it should work.
View on Reddit #48360380

noiserr@reddit

Been using my 7900xtx for LLMs for over a year. And it has been smooth sailing for me. I would definitely get two of these GPUs if they were priced for consumers as vRAM capacity is a major bottleneck.
View on Reddit #48363269

sandor2@reddit

should work good on linux, if its supported by rocm
View on Reddit #48363265

Kryohi@reddit

The big question is whether it will be called 9070XTXX or 9070 AI XT
View on Reddit #48373631

GenericUser1983@reddit

Radeon AI 9070 XT 32 GB AI Edition.
View on Reddit #48381801

PalpitationKooky104@reddit

or bob? Doesnt mater. Or 9070xai?
View on Reddit #48379326

mokkat@reddit

A 9070xt with 32gb VRAM is a good idea. It caters to the crowd who needs it for work, as well as enthusiasts who want to pay more for the "best" card. They can probably earn more % profit since it doesn't need to be great value like the regular 9070xt. A "gaming" 9070xt with 32gb VRAM is stupid as heck. Not even Nvidia can put 32gb on a card and call it "gaming"
View on Reddit #48380387

Farren246@reddit

Entirely unnecessary for a mid-range card, but there might be some buyers. If you actually need 32GB you'd want a high core count to go along with it... and most people would probably want CUDA to go along with it too.
View on Reddit #48358826

HLumin@reddit (OP)

Nah, prolly a 9090 XTX or something
View on Reddit #48356272

OafishWither66@reddit

they aren't gonna make a flagship card this gen
View on Reddit #48356347

iothomas@reddit

It is alleged that when they realised that the 9070 was closer than they expected to the 5080 they pivoted to a 9080 32gb idea that could be just a few % above the 5080
View on Reddit #48358452

Jaegs@reddit

Could just be one that they want to market to people looking to run LLMs locally too,  double the memory is super useful for getting the higher parameter models to run.
View on Reddit #48356386

animeman59@reddit

9070 XTraRAM
View on Reddit #48357212

Krotiuz@reddit

Apparently confirmed by the leaker at the end of the article, so yes
View on Reddit #48356534

hitsujiTMO@reddit

I'm going to take these with a pinch of salt. Why are we hearing that AMD aren't going to compete in the high end market, yet still hundreds of rumours of high end cards?
View on Reddit #48356454

Aquaticle000@reddit

My question is, why a 9070 with that much ram specially? I’d expect that amount of VRAM for something like a 9090 or a 9090xt, which we clearly aren’t getting. At least not anytime soon.
View on Reddit #48356636

Strazdas1@reddit

9070 is the highest tier card AMD will produce this generation.
View on Reddit #48445085

Aquaticle000@reddit

That seems to be the trend so far, I’m aware. It was an example.
View on Reddit #48445460

szczszqweqwe@reddit

AI in general, some productivity tasks and game mods.
View on Reddit #48358441

Aquaticle000@reddit

This is obviously not a gaming GPU, which is why I questioned it. I don’t know why the leak is calling it one, it’s definitely not.
View on Reddit #48426343

kontis@reddit

If someone is making cinematics/product videos in UE5 editor or doing some AI this would be an amazing card. Complex CG scenes (no game-like optimization) in UE5 editor can even crash a 24 GB card. AI models don't even start below VRAM requirement.
View on Reddit #48361609

Aquaticle000@reddit

Now see this makes sense, but the leak claims It’s a gaming GPU, which is why I called the naming into question, it didn’t make sense.
View on Reddit #48426246

Jeep-Eep@reddit

Buy that fucker and coast across most of the refresh and next console gen.
View on Reddit #48389908

the_dude_that_faps@reddit

It's niche, but it gives people another reason to choose this over Nvidia.
View on Reddit #48357582

TheAgentOfTheNine@reddit

LLMs. If you offer a lot of ram, even with a low performance, they'll fly off the shelfs.
View on Reddit #48356743

szczszqweqwe@reddit

Why? They can just clamp another 16GB on a back of a 9070xt's PCB, do a slight OC and call it 9070xtx.
View on Reddit #48356707

hitsujiTMO@reddit

You still need to be able to interface all the chips with the soc. That alone means a much larger soc for the extra pinouts. So they would have a whole separate chip for this one card that costs them a whole lot more and they get fewer chips per wafer. I don't buy it. If they did this it would have to be a more performance chip to make it worthwhile. It would be a 9080 or 9090 not a 9070.
View on Reddit #48357638

Strazdas1@reddit

if you do clampshell you retain same bus width and same bandwidth, just with larger storage and significantly higher costs.
View on Reddit #48445044

szczszqweqwe@reddit

Not when they put half of memory on a back of PCB. 4060ti and 4060ti 16GB have a die with exactly the same area: 188mm^(2) [https://www.techpowerup.com/gpu-specs/geforce-rtx-4060-ti-8-gb.c3890](https://www.techpowerup.com/gpu-specs/geforce-rtx-4060-ti-8-gb.c3890) [https://www.techpowerup.com/gpu-specs/geforce-rtx-4060-ti-16-gb.c4155](https://www.techpowerup.com/gpu-specs/geforce-rtx-4060-ti-16-gb.c4155)
View on Reddit #48358349

Hayden247@reddit

The RX 7600 XT vs RX 7600 did the exact same thing from AMD too. Tho the XT was an OC as well so that's how they slapped the XT on. It'd make sense for AMD to clamshell the 9070 XT and make it 32GB. I'd love the 9070 XTX name for funises but apparently just 32GB, fine it is accurate. Some people wanted more than 16GB so it'll find a market even if 700USD. I mean it would be 32GB on the "cheap" since the RTX 5090 is the other 32GB GPU for well over double the price. Near triple probably.
View on Reddit #48365913

szczszqweqwe@reddit

Yeah, I might be one of a customers.
View on Reddit #48366314

Vushivushi@reddit

No it's called clamshell mode where you have two 32-bit memory chips sharing the same 32-bit memory controller. You don't need a new chip with a larger memory bus. You get more capacity at the same bandwidth with this configuration.
View on Reddit #48359075

Numerlor@reddit

clamshell doesn't need additional IO
View on Reddit #48358001

Jeep-Eep@reddit

And honestly, that thing would go for fuckin' years, so it would probably honestly be worth.
View on Reddit #48374719

tobimai@reddit

Could just be a mid card with a ton of memory. Especialy for LLMs interesting
View on Reddit #48387320

moschles@reddit

Article literally says "Tagged as rumor"
View on Reddit #48367756

Visible_Witness_884@reddit

Is it highend with 32gb and a regular core?
View on Reddit #48356631

hitsujiTMO@reddit

Fair enough, didn't see the update that it's just a 9070 xt. But I still don't believe it. There's no 4gb chips so they'd have to have a large enough soc to be able to pin out double the amount of chips and have its own board design with half the chips on the back of the board giving it a 512bit bus. If they said 24gb variant, yeah. But a 32gb variant could only be a professional card.
View on Reddit #48357049

Visible_Witness_884@reddit

Yeah... doesn't seem like the greatest product for gamers. I guess we'll have to wait till next gen from both nVidia and AMD to upgrade if we are already anywhere north of a 7800XT / 4070 in performance.
View on Reddit #48357411

reddit_equals_censor@reddit

should be a perfectly great product for gamers if amd just ads the vram price difference on top of the 9070 xt 16 GB version. assuming the 9070 xt 16 GB is priced aggressively, a 32 GB version would be great value short and especially longterm. 16 GB of more gddr6, that is in the worst case 60 us dollars more. probably a bunch less for amd. if amd charges 500 us dollars or 600 us dollars for the 9070 xt 16 GB and then charges 660 us dollars or 560 us dollars for the 32 GB version, people would buy them without question. just like they bought the 8 GB version of the polaris 10 cards (rx 480 for example)
View on Reddit #48359762

Visible_Witness_884@reddit

32 gigs is just a bit useless on a semi-midrange card if it comes with a price premium. Like, if they made a 32gig 5070 - it'd just cost more and not perform better.
View on Reddit #48361632

reddit_equals_censor@reddit

just give people the choice and let the market decide. i want my next card to have at least 32 GB vram. they can have the market decide and sell more graphics cards like they did with the 4 and 8 GB polaris 10 cards. >if they made a 32gig 5070 the 5070 with 18 or 24 GB would perform better though, because it has only 12 GB vram and with raytracing you can get into scenarios where 12 GB vram becomes an issue. nvidia even straight up told hardware unboxed, that yeah 12 GB vram won't be enough in some scenarios. nvidia sees the 5070 as the card to NOT get. the compromised card. DELIBERATELY COMPROMISED. so yeah a 24 GB (clam shell) or an 18 GB (1.5 x capacity one sided) OR a 36 GB version would perform better than the 12 GB insult already in some scenarios according to nvidia themselves when talking to reviewers... and if you automatically thought, that SURELY the 5070 has 16 GB and not 12 GB, it shows what a middle finger the 12 GB on that card is yet again. and people who might buy a 9070 xt with 32 GB might use their card or be able to use it for the entire ps6 generation so far into the future. yes lowered other settings, but max texture quality and still working just fine. maybe the ps6 will lead to 24 GB vram used at high quality settings, so bye bye 16 GB vram at high quality settings by then. and cards are used for that long if they still work. people have used (some are still using) polaris 10 cards with 8 GB vram. 8 GB vram is already a major issue, but the 4 GB version said good bye way way way way before the 8 GB vram versions. btw worth keeping in mind, that both nvidia and amd could leave it up to partners to configure the memory and let the market decide. partner putting 16 GB on a 4060 non ti for example would vastly outself the 8 GB insult if the difference in price would be just the 8 GB added memory. 24 GB or 18 GB 5070 would vastly outsell the 12 GB insult we can expect. especially nvidia does NOT want customers to have the choice on vram. it is NOT a hardware limitation at all even at a bs tiny bus. they just want to scam people and massively upsell people and have planned obsolescence. people on 12 GB vram will probably be heavily inclined to upgrade once the ps6 comes out and the first ps6 only games come out. while 18 or 24 GB version people would sit on it for 2 more years possibly as it still runs all games decently fine, if it would exist...
View on Reddit #48365624

Visible_Witness_884@reddit

The increase in VRAM over what is used makes no performance gain. There might be a slight gain from having a higher bus width. But if there's no use in it, it doesn't matter.
View on Reddit #48365886

reddit_equals_censor@reddit

>There might be a slight gain from having a higher bus width. the examples mentioned had no bandwidth increases. 1.5 x capacity or clam shell, or 1.5x capacity memory chips + clam shell all have the same memory bandwidth and bus. and again nvidia straight up told told tech reviewers, that yeah the 12 GB card will already have issues at launch in some games and that it is the compromised version. "why aren't you getting the working 16 GB 5070 ti, you poor pleb" is the nvidia mentality. and a disgusting mentality at that. so 16 GB over 12 GB matters according to nvidia!!! even.
View on Reddit #48366075

Visible_Witness_884@reddit

Well, yes, if you want to play all the games with full RT and at max quality. There are loads of games where it won't matter. But I totally agree that nvidia are putting out cards with too low vram for the performance potential. But that's the thing - having absurd VRAM without the performance potential to utilize it? That's just waste of resources.
View on Reddit #48366312

kontis@reddit

They could easily add $200 premium and it would still sell like hot cakes even if it's much slower than 5080. There is no other product like this on the market. Even for a gamer it's super cool to have so much headroom in VRAM.
View on Reddit #48361466

reddit_equals_censor@reddit

they probably could, but i'd argue they shouldn't. they claim to want to gain marketshare that is a great way, while keeping profits the same as the 16 GB card, which will still be great profits at a 500 us dollar price point for the 9070 xt 16 GB. also would be a way to get a ton of gamers away from nvidia. a bit slower than a 5080 for vastly less money with double the vram. that is a deal to take. that is an exciting product. that is sth, that gets people to switch to amd, especially those, who got burned by an 8 GB 3070 ti already for example. and charging a 200 euro/us dollar premium for 60 us dollar (they pay probs a bunch less) for the added 16 GB vram would feel disgusting and probs get pooped on reviewers as well. people would rightfully compare it to the 4060 ti 16 GB scam pricing, that charged 100 us dollars more just to add 8 GB vram and make it a working card. now 16 GB is for now already a working graphics on the 9070 xt, but charging 200 us dollar premium for at worst 60 us dollars more vram would invite those comparisons. if they go 500 us dollar 16 GB and 560 us dollars 32 GB (or 600 and 660), then people will literally see it as a pure vram choice for people who want more vram and a pro customer move and reviewers should praise it massively. that would also be longterm thinking to gain marketshare on a good generation to do it, instead of trying to get a bit more money out of viewer cards sold...
View on Reddit #48364915

reddit_equals_censor@reddit

wrong. clamshell. memory on both sides with the same memory bandwidth. done in the past. done on the 4060 ti 16 GB insult. so 32 GB with 256 bit bus is no problem at all with gddr6. this is also always worth keeping in mind, when a company is lying to you about "oh that is the only possible vram configruation", which is a double lie, because they decided on the memory bus and THEN on top of that decided to only put memory on one side of the card instead of both (clam shell)
View on Reddit #48359612

Healthy_BrAd6254@reddit

7600 XT runs 16GB on 128 bit This is a 256 bit GPU
View on Reddit #48357590

the_dude_that_faps@reddit

Just clamshell it. GDDR6X should be cheap these days.
View on Reddit #48357517

LuminanceGayming@reddit

could be clamshell like the 4060 TI 16GB
View on Reddit #48357410

ehxy@reddit

to be fair....I was kind of expecting a 5070RTX Ti to have AT LEAST 24GB memory but they kinda shit the bead there
View on Reddit #48356984

gatorbater5@reddit

midrange nvidia with a 'surplus' of memory? you must be new. they always have just enough for the moment.
View on Reddit #48357227

Raining_dicks@reddit

Inb4 24gb 5060 ti
View on Reddit #48358196

RealOxygen@reddit

Nvidia moved the needle for what a high end card is Current gen 80 series is typical 70 series performance for 80ti/90 series pricing
View on Reddit #48357831

Significant_L0w@reddit

they have to compete in gaming if they want to keep in track with AI race, it is actually inter connected if you think about it Besides gaming there is huge demand for high vram cards for local llm
View on Reddit #48357731

verci0222@reddit

What's the point if the performance isn't there lmao
View on Reddit #48356663

iMaexx_Backup@reddit

Not everything is about gaming lol
View on Reddit #48356975

verci0222@reddit

Creatives and those looking for local ai machines are on Nvidia so I still fail to see what's the use case for this
View on Reddit #48357260

iMaexx_Backup@reddit

Nope, not necessarily.
View on Reddit #48357948

-Purrfection-@reddit

What kind of productivity tasks can be done on AMD?
View on Reddit #48358047

shugthedug3@reddit

Just about any. Performance isn't usually the best.
View on Reddit #48360590

Strazdas1@reddit

if you need to have software acceleration fallbacks then yeah, performance will suffer. But hey if you are lucky that you can deploy on AMD and not have to spend triple the unit costs on troubleshooting, go for it.
View on Reddit #48445377

noiserr@reddit

You can run LM Studio or any other inference framework for local AI coding assistants.
View on Reddit #48362020

iMaexx_Backup@reddit

Most of them. It’d be much easier to name the handful of tasks that can’t be done with AMD.
View on Reddit #48358348

Mean-Professiontruth@reddit

Mentally gymnastics apparently
View on Reddit #48358307

PorchettaM@reddit

VRAM capacity is the single most important spec for local AI. If AMD can offer 32 GB at a price point where Nvidia is offering 16, the hobbyists will be tempted to switch, even if it means having to deal with ROCm.
View on Reddit #48358153

reddit_equals_censor@reddit

for the same hobbyists: may i interest you in a 128 GB unified memory strix halo mini pc? ;) i mean sucks, that you can't get it with ecc it seems, but yeah 128 GB with a basic high end up, when a 5090 is just 32 GB and even the 5090 workstation version would be just 96 GB... well :D smart amd marketing team (doesn't exist) would look at strix halo and at a 32 GB 9070 xt and market the shit out of vram/unified memory for gamers and professionals. show games straight up breaking down with some 8 GB mobile dedicated gpu insult and show models or 3d renders eating tons of memory, while the competition breaks or crawls to an almost hault. but that is what purely hypothetical smart amd marketing team would do among other things, so sadly it will never happen :o
View on Reddit #48360410

devinprocess@reddit

Have you seen strix halo prices for just 32gb stuff :) They make apple blush.
View on Reddit #48367165

noiserr@reddit

Strix Halo isn't going to be cheap and 9070xt will be faster.
View on Reddit #48361982

reddit_equals_censor@reddit

memory size would be the thing, that matters most for people who want to run local llms. strix halo from my understanding would be fast enough and running a model, that takes up 128 GB would be vastly more interesting. will be very interesting how much amd will charge for strix halo, because theoretically it doesn't need to be super expensive in a laptop or mini pc. THEORETICALLY. both would be great options at possibly different price points. if strix halo is priced agressively, then it could be a great deal if you buy it in a laptop, because you'd also get a great laptop to carry with you, when you aren't running an llm at home theoretically. but they will probs charge a ton. either way they should just do united market for vram to the moon options for customers with both products. it is easy to market sth, when the competition is releasing broken hardware probably again (8 GB vram at laptop for nvidia we expect)
View on Reddit #48365840

kontis@reddit

Exactly, they are on Nvidia. So how to get the OFF Nvidia? With a much higher value product per $.
View on Reddit #48361812

Jaznavav@reddit

Inferencing on AMD *works*, even if worse than Nvidia. Between a 32 and 16gb GPU for local ai the choice is obvious, you can cope with ROCM on linux, you can't cope with having half the memory.
View on Reddit #48358429

Archer_Key@reddit

it should be more powerful than the 7900xt, and yet it have less vram
View on Reddit #48358149

Krugboi@reddit

They cancelled "9080/9090 xt" models believing that they cant catchup to nvidia, but after seeing miserable %10 uplift from 5080 maybe they decided to make it again?? I hope so
View on Reddit #48356622

Strazdas1@reddit

given that we already have AMD cards on shelves, theres no way they have the time to alter the architecture now.
View on Reddit #48445104

amazingspiderlesbian@reddit

32gb of ram would be a 5090 competitor. And they would have to somehow make something like 70% faster than the 7900xtx to compete with that.
View on Reddit #48356830

Krugboi@reddit

They competed 4080 with xtx 24gb's vram, same vram doesnt mean performance segment especially when 5080 should've had 24gb
View on Reddit #48356966

amazingspiderlesbian@reddit

I mean, i guess they could make a selling point of having slow gpu with tons of vram. For like AI stuff market. But it's not really good for gamers since you're going to get a gpu that would probably cost the same or more than a 5070 ti 16gb with less performance than that with a 32gb 9070xt. And like zero actual use for it outside of machine learning
View on Reddit #48357081

Slyons89@reddit

"Gamers" in general are simple to market to, and believe bigger number better, regardless of whether they need it. It's silly, but just because it's practically useless doesn't mean people won't pay extra for it. We see this all the time with things like $500 motherboards and and 1200W power supplies.
View on Reddit #48370206

noiserr@reddit

You don't need 32GB for gaming true. But for local LLMs this would be quite sweet. For one it should be twice as fast as Nvidia's DIGITs.
View on Reddit #48363367

SubstantialInside428@reddit

Weird, I thought 9070XT was the only planned monolithic die for RDNA4 and that's the real reason 80 class got canned, because MCM is still too tricky/costly to make sense ?
View on Reddit #48357020

JakeTappersCat@reddit

Imagine how depressed 5080 buyers will be when AMD releases a 9070XT 32GB for $699 that nearly matches their card in speed and has double the VRAM. Maybe it would teach people a lesson about buying nvidia at launch when they try to rip everyone off
View on Reddit #48369716

TateEight@reddit

Yeah lol this isn’t gonna happen Shit is still gonna die as soon as your turn up RT buT i hAvE mOAr VrAM
View on Reddit #48439203

TateEight@reddit

Can smear your screen with Vaseline with dookie ass FSR too
View on Reddit #48439221

rTpure@reddit

AMD could make a card that's twice the speed at half the price, and people would wait for nvidia to lower their prices so they could buy a geforce instead
View on Reddit #48401065

Reasonable_Can_5793@reddit

I’m a big fan of this idea. It’s about time they stopped making the 16GB version the high-end model. This would definitely encourage Nvidia to make the 6080 at least 24GB of VRAM, and every mid-range card should have at least 16GB instead of 12GB. And the low-end cards should have at least 12GB.
View on Reddit #48432494

Optimal_Visual3291@reddit

Gamers don’t need 32gb. This would be aimed at “pro” type card. I’ll take my down votes.
View on Reddit #48431786

Astigi@reddit

There is no significant improved performance in RDNA 4, just another 16GB VRAM on XTX version. Nvidia has never been worried about AMD
View on Reddit #48425339

TigerBalmES@reddit

I’ll definitely buy that over ngreedia products
View on Reddit #48421681

EntertainmentMean611@reddit

Honestly... a slower core and give it 128g of vram .. even with slower vram and it would sell like nuts for local AI hosting.
View on Reddit #48399529

Gronkers@reddit

Could finally load all those skyrim mods...
View on Reddit #48394579

ea_man@reddit

This would make so much sense that AMD will never contemplate it, they would send wagons to the AI folks.
View on Reddit #48387735

Doubleyoupee@reddit

Absent 9080xt confirmed. That's why they needed some more time!
View on Reddit #48356758

SubstantialInside428@reddit

Hope not, I don't want buyer's remorse for taking the 9070XT
View on Reddit #48357086

PalpitationKooky104@reddit

more remorse for a 5080 at1300
View on Reddit #48381813

SubstantialInside428@reddit

Those buyers deserve hell
View on Reddit #48384618

funix@reddit

So we're moving into an era where our GPUs will have more RAM than our motherboards.
View on Reddit #48382161

Stennan@reddit

This doesn't sound like the worst idea AMD has come up with ("Poor Volta" would fit that description). DDR6 doesn't have that many customers, so 16GB should cost around 40-50$. Add a bit more for a sandwiched PCB, and AMD could launch a 32GB "9070" below 1000$.
View on Reddit #48356559

Jeep-Eep@reddit

We'll probably see similar with 16 gig 9060s and the like.
View on Reddit #48375092

SnowZzInJuly@reddit

And it’ll have all the same inventory issues all high end cpu parts do now. You won’t get one until 2 years later. Enjoy 😊
View on Reddit #48356194

Fer-Butterscotch@reddit

Eh, it's a bit niche. Paying more for no appreciable extra performance in most gaming scenarios seems like it'll keep demand down among people who won't legitimately utilise the extra ram.  Or sheeple will sheeple and buy it to get high huffing the spec sheet. One of the two.
View on Reddit #48356544

Jeep-Eep@reddit

I mean, I'd go out of my way to grab it because radeon double VRAM SKUs have a long history of being enduring performers.
View on Reddit #48375008

MiloIsTheBest@reddit

I would definitely huff that spec sheet.  I don't *quite* feel like 32gb is completely necessary but I certainly wanted *something* to offer more than 16gb this generation without costing $5000AUD A 24GB 5080 would've suited me just fine
View on Reddit #48357204

Turtvaiz@reddit

> A 24GB 5080 would've suited me just fine That seems rather likely to be releasing later on as a super model
View on Reddit #48373907

Fer-Butterscotch@reddit

Yeah, look, I'd buy one just to throw some weird shit at it and see how it handles it, but it wouldn't be games I don't think. I don't know too much, but my impression was that 16gb is plenty for 4k, and I don't think the compute power is there for 4k triple screen in any real way? Not sure tho, happy to be proven wrong.
View on Reddit #48358533

MiloIsTheBest@reddit

How much is plenty depends on the game and what assets it would want to load.  The more detailed and more expansive games are the more they require regardless of resolution.  I want something that will not have an issue storing assets for quite some time. And as far as I'm concerned 16GB today is 8GB 5 years ago. Maybe handles pretty much everything *now*, but this thing's gotta last me a couple of generations. For me, I like to Flight Sim (for the scenery, I'm a filthy casual) and I play a few games which, thanks to an under provisioned GPU, run afoul of VRAM long before the compute performance is too low.  For my next card, too much is far preferable to 'probably enough', and none of the regular cards have been announced with anything more than "probably enough".
View on Reddit #48359107

NGGKroze@reddit

Those while gamers try to buy them will be scalped or brough for LLM use due to high VRAM.
View on Reddit #48357251

Fer-Butterscotch@reddit

My point is that there are very very few gamers who have a reason to buy this just for gaming. And that's a good thing. It's a niche product and I'd love to see it get made.
View on Reddit #48358418

Visible_Witness_884@reddit

2 years later and at higher price than when it launched.
View on Reddit #48356644

ButtPlugForPM@reddit

Hopefully they come out swinging with UDNA give something thats like 60 percent on top of a 7900xt.. but go really off book,and do multichiplet,and give it HBM for crazy bandwidth..
View on Reddit #48373776

ga_st@reddit

Yes please. This, and unusable 150x multi frame gen just to fuck with Nvidia's "performance" graphs. Please AMD.
View on Reddit #48359103

reddit_equals_censor@reddit

i do want to see a meme marketing slide, that shows a 16x fake interpolated frame gen mode. laughing it up at nvidia's bs and then moving on to ACTUAL performance in the next slide. moore's law is dead made a lil short showing sth like that: [https://www.youtube.com/shorts/XYWPKbbwS18](https://www.youtube.com/shorts/XYWPKbbwS18) but that would be clever marketing, so we won't see that from amd ;)
View on Reddit #48360654

ga_st@reddit

I see, I don't really follow or watch anything from the guy, but I guess he making the same joke is the reason why my post is being downvoted. That, or people who really like to be shafted by alligator leather jacket man. So many on this sub. Some people just like the whip, you know? And jacked boy knows his customers very well, so whip it is. There are people ITT doing the *how can she slap face* because they can't fathom the possibility of a 70/80 class GPU having 32GB of VRAM in big 2025. Those people have been scarred by Nvidia, it's so sad to watch. In any case yea, if I were AMD I'd really do that. After that, I'd go on a divergent path, do my own thing compared to Nvidia, advance graphics, sustainably, together with the community and partners, which are plenty and influential. AMD owns the whole console and handheld market, they should really stop chasing Nvidia's shenanigans. Those saying AMD should follow Nvidia are monumental morons.
View on Reddit #48371907

Ch1kuwa@reddit

Double the VRAM, double the idle power consumption on a multi-monitor setup.
View on Reddit #48363194

Reggitor360@reddit

So like the 3090/5080/90 did since launch and no one cared about the 90-125W idle?
View on Reddit #48367699

AreYouAWiiizard@reddit

Maybe they realized they can add a heap of VRAM and dump a heap of power into the 9070XT and sell it as a 9080 for extra profit since the 5080 showed a disappointing increase?
View on Reddit #48364002

GenZia@reddit

Since I'm no AI/Llama/DeepSeek expert (to put it mildly), would anyone mind explaining exactly how reliant these LLM models are on bandwidth? If all they need is VRAM and don't depend heavily on bandwidth (or at least don’t require a ton of it), then it might not be the worst idea to explore LPDDR-X instead of the more traditional GDDR or HBM. For example, Samsung currently offers 128Gb @ 32-bit LPDDR5X running at 7.5 Gbps. That means you could have up to 128GB on a 256-bit wide bus, though the bandwidth would be roughly comparable to something like a GTX 1070 Ti (8 Gbps GDDR5X). That said, I’m not entirely sure if 128GB would be enough to run the full-blown 236B parameter model locally at 4-bit.
View on Reddit #48361162

noiserr@reddit

Bandwidth is important. 9070xt has 624.1 GB/s bandwidth which is nothing to sneeze at. It's slower than 3090, 7900xtx but not by much. To put it in perspective this is still twice as much bandwidth that Strix Halo or Nvidia's DIGITS.
View on Reddit #48363547

koolaidismything@reddit

Even low end GPUs mostly have more RAM than my whole system. I need to go look at a modern AAA game. I don’t game the last one I saw and was like woah was Arma maybe.. like 2017
View on Reddit #48362947

lordofthedrones@reddit

Compute card but gaming pricing? Please, please be real.
View on Reddit #48362160

_Restless_Spirit_@reddit

Oh my god, yes please. With local LLMs, the more VRAM the better.
View on Reddit #48357339

moschles@reddit

What software can utilize AMD GPUs to train LLMs? (Unless you meant local inference? )
View on Reddit #48361313

noiserr@reddit

You aren't going to train models on desktop GPUs. You usually need much more compute and datacenter GPUs for that. You can fine tune on Radeon using qLoRa though. And yes inference.
View on Reddit #48361815

noiserr@reddit

If true this is great news for local llama enthusiasts. Literally what I've been waiting for.
View on Reddit #48361066

noiserr@reddit

This is all I ask for. Fuck yes.
View on Reddit #48360953

PeakBrave8235@reddit

Meanwhile M4 Max is 3090 level desktop performance with 128 GB of graphics memory. 
View on Reddit #48356348

Fer-Butterscotch@reddit

What's the cheapest you can get an M4 Max?
View on Reddit #48356472

PeakBrave8235@reddit

Sounds like a rhetorical question so you should try to make your point you want to make
View on Reddit #48356522

Fer-Butterscotch@reddit

Honestly asking, I dont know Macs. There earnt many options for 32gb cards, so even a few grand is not really insane.
View on Reddit #48356587

PeakBrave8235@reddit

Oh gotcha! Well in desktop form it’s likely to start at $2,000 USD. The current M2 Max desktop model starts at $2,000 for 32 GB. 
View on Reddit #48356711

Fer-Butterscotch@reddit

OK, that's pretty impressive since I think you're currently looking at the workstation type cards which run thousands of USD to get 32 gb and decent GPU power. Good call.
View on Reddit #48356837

PeakBrave8235@reddit

I agree totally! The rumored Apple M4U chip will likely be on the level of a 4090 Desktop GPU, which is based on past Blender results and extrapolation  And it’s also rumored to have 256 GB of memory.  So you cohld have 4090 levels of power with 256 GB of graphics memory. And because the memory is unified (and not shared), it means you truly have access to all that memory for anything you want to do!
View on Reddit #48357038

Fer-Butterscotch@reddit

Freaking ay, that's what having design control of the end to end platform and a ready made consummer base will do for you!  I gotta admit, I was puzzled when Apple said they were going to bring chip making in-house, but it was a 100% genius move.
View on Reddit #48357178

PeakBrave8235@reddit

I agree! Exactly! One more thing. I’ve seen some videos of an Apple MLX Engineer (MLX and Metal are most equivalent to “CUDA”) testing those Large transformer models, and he was able to connect 3 M2U chips with 192 GB of memory each, and while running a LLM machine learning model, the whole thing only took 100 watts of energy at its peak, with the average being 60 watts for all 3 Macs together. 
View on Reddit #48357895

Fer-Butterscotch@reddit

Yeah, I'd heard about that idea of interconnecting them, but I'd forgotten about it.  Very cool, exciting times.
View on Reddit #48358374

szczszqweqwe@reddit

So a 4070s performance with lots of memory, but it's a shared memory, not GPU only memory, still at 128GB it doesn't make much difference in most cases.
View on Reddit #48356628

PeakBrave8235@reddit

It absolutely makes a difference lol. The memory isn’t “shared,” rather it’s stored in one place. Instead of one pool of memory being split between the CPU and GPU, all data is stored in the memory and can be simultaneously accessed and manipulated by both the CPU and GPU. It’s not like shared memory models such as Intel’s iGPU
View on Reddit #48356802

szczszqweqwe@reddit

You are right, I meant a single pool of memory for both without any division. I haven't got iGPU for a while and it definitely shows.
View on Reddit #48357296

PeakBrave8235@reddit

Well traditional iGPUs use shared memory, which is taking a portion of memory and allocating it. The CPU or GPU cannot directly manipulate it without first having to copy the data and send it to either (example CPU wants to manipulate data that the GPU is working on, it needs to first copy that data) Whereas the unified model is one single memory pool where both the CPU and GPU can access data without needing to copy it first. It speeds stuff up and saves on power, while also providing large amounts of memory. LPDDR has lower bandwidth than GDDR or HBM, but the benefit is 1) large amounts of it, 2) low energy costs, 3) not as expensive. 
View on Reddit #48358072

Kionera@reddit

AMD has Strix Halo with 96GB of shared memory, there are several mini desktops being made around it.
View on Reddit #48357262

PeakBrave8235@reddit

Yes that’s true, but Strix Halo doesn’t match the 3090 desktop in performance. 
View on Reddit #48357345

AutoModerator@reddit

Hello HLumin! Please **double check that this submission is original reporting and is not an unverified rumor or repost** that does not rise to the standards of /r/hardware. If this link is reporting on the work of another site/source or is an unverified rumor, please delete this submission. If this warning is in error, please report this comment and we will remove it. *I am a bot, and this action was performed automatically. Please [contact the moderators of this subreddit](/message/compose/?to=/r/hardware) if you have any questions or concerns.*
View on Reddit #48355905