If this is true, say bye bye to 599 9070XT 16GB.
599$ for 9070
799$ for 9070XT
999$ for 9070XT 32GB - you are getting 5090 VRAM, but for half the money (MSRP wise) while also being in performance between 4070Ti Super and 4080S.
Problem with with such high VRAM, those will be scalped so hard so either you won't find any or you will find them, but lot higher than MSRP.
>Problem with with such high VRAM, those will be scalped so hard so either you won't find any or you will find them, but lot higher than MSRP.
mid size dies with dirt cheap gddr6 and basic packaging. and a not new node.
amd can pump those out way beyond scalping possibility.
i mean hell they added 2 more months of production already with the delay of the cards.
there is also 0 uniqueness about the 32 GB version. same die clam shell and done, so amd could easily shift most of their production to the 32 GB version if they are vastly more in demand.
Yeah, RDNA 4 was designed with manufacturability and cost effect first in mind, in that order. They've acting like the console guys, not trying to catch nVidia this gen.
>not trying to catch nVidia this gen.
it is worth remembering, that amd has a long history of just making midrange cards every other generation even often.
so it is nothing new.
now arguably with how shit the 50 series is, the cancelled chiplet gddr7 monster high end gpu could be seen as a mistake, but who could have guessed, that the 50 series will be such a piece of shit/stand still.
and in regards to catching nvidia, if the leaks, which we got plenty by now are true, then amd is on part or ahead in architecture.
amd performance/cost of the whole card should be ahead of nvidia, as they are still using dirt cheap gddr6, instead of gddr7 even.
raytracing performance being massively up and fsr4 upscaling being a big step forward.
so architecturaly they seem to be at bare minimum on par with nvidia, which is very impressive.
and that is the hard part. making a high end version of an architecture just in general is not the hard part.
although in rdna4's specific case it would been uniquely harder, because instead of just a massively bigger monolithic die, they went insanely chiplet based with split cores (rdna3 chiplet is child's play in comparison)
there are a lot of companies you can reasonably accuse of "ai obsession", but probably not the one that is "selling the pickaxes" so to speak who currently has a license to convert all available tsmc allocation directly into money because the industry has an insatiable demand for nvidia GPUs regardless of how gamers view the latest RTX cards
People said the same thing about RDNA2, yet there was hardly any supply when NVIDIA GPUs were going for 3x MSRP. I hope they do pull it off but I’m not holding my breath.
I dont remeber the version number. It was a bit over a year ago. We ran into an issue we couldnt solve. There was no documentation about it. Contacting AMD lead to nothing (basically got told to fix it ourselves). Eventually department head decided that it would be cheaper to just switch to Nvidia instead of trying to fix ROCm.
But why a *9070*? I feel like something with that much VRAM would be reserved for like a 9090 or 9090xt, like the 7900xtx with 24GB and the 7900xt with 20GB.
This just seems really out of place?
>Frankly I'm annoyed it doesn't *already* have 24GB
i think price should define if 16 GB is fine or not for now.
a 500 euro 9070 xt with 16 GB seems fine this generation. a 600 euro one.. meh.
having a double vram edition for the memory difference cost only would make up for that though.
the rx 480 4 GB and 8 GB did that and people loved that. choice instead of scams.
Some of you are really delusional, 500 EUR would put it at 449$ MSRP tops, there's no way it will be that low, you're literally setting yourself for disappointments or just looking for excuses to buy nVIDIA.
500 euros may be 500 us dollars, based on theft and inflation theft at the time.
and worth keeping in mind, that 450 us dollars for a 9070 xt would still mean lots of profit for amd... just to keep that in mind.
>or just looking for excuses to buy nVIDIA.
that is kind of hard for me, as i won't put a fire hazard 12 pin device in my home, nor will i torture myself with nvidia's gnu + linux proprietary drivers, nor will i throw money at missing vram.
so wrong person to put that suggestion up :D
and disapointment is all but assured by amd's leadership and especially marketing team.
<points at last few releases with bad reviews, bad prices followed by price decreases shortly after, so that amd can anjoy bad reviews with acceptable prices at the same time :D
inflation is fully artificially created by physical or digital money printing, that may or may not have any backing.
as the nuked value is now in the printed money and as that printed money is used by the kakistocracy and doesn't go the public, that is indeed theft!
here is a great lil meme video with lovely music, that explains it in a very nice way ;)
[https://www.youtube.com/watch?v=WEMCYBPUR00](https://www.youtube.com/watch?v=WEMCYBPUR00)
the graph in the background is the value of the us dollar ;)
and important to understand, that inflation is NOT inherent to any money system in society.
any such idea is complete nonsense. we had tons of money systems without inflation in the past.
to name just one commonly known one, directly value backed currency or direct value in the currency.
as in gold backed currency or silver backed currency, or direct silver or gold coins with fixed weight.
so yeah the added theft in the form of "tariffs" is far from the only theft from the government.
but any more than that would e too far off from the hardware topic.
but it makes sense to know when you look at hardware prices over 2 decades with money printers going brrrrrrrrrrrrrrr very hard to devalue YOUR money.
while money printing encourages inflation, thats not its only source.
i understand your dislike for inflation, and we could discuss this somewher else as this isnt the right place if you want, but i dont agree it should be considered theft. And tariffs are only theft if you are one of those "all taxes are theft" people.
almost as if one shouldn't throw their naming over board to match the competition, that is already full of shit with their naming :D
an 8800 XTX 32 G
would for example slot in way better naming wise if they wanted to, but nah....
let's not do that :D
gotta make sure, that no one has any idea what an amd card's name means even enthusiasts, while the competition, that owns the market has scamed people with names, but kept them consistent for years and years....
uh shit!
what if amd already changes names completely before the 32 GB version of the 9070 xt comes out ;)
Remember Nvidia released the 4060ti with 16GB vram, which now is still equal to the 5080 vram. Helps throw a bone to people who don’t need top top tier gaming but need the vram at a lower cost.
Well, as far as we know a bigger die doesn't exist, it will take a lot of time to resurrect it, as they abandoned it long time ago.
If they want to release it soon-ish 9070xt and memory on a back of a PCB is probably the only way.
To all downvoters... https://videocardz.com/newz/sapphire-radeon-rx-9070-xt-nitro-pictures-reveal-hidden-16-pin-power-connector-design
Lol...or you are just Nshitia fans?
if amd aren't idiots, then they will release a 9070 xt with 32 GB, that has only the memory cost difference added to the 9070 xt price.
why? because the profit stays the same. it gives them a great vrm argument, especially if they ONLY release 16 GB cards and up this generation and it makes it look better to the 5080 insult and even the 5090, if you want a card to use for a long while or need more vram for other applications already.
that would be the smart move, that would be the "gain market share" move.
so amd marketing might fight tooth and nail from letting that happen :D
This idea, is exactly spot on. Hope greed doesn't get in the way of patience and sheer logic. If I'm even considering buying the 32gb version after a solid no to the 16gb version due to pcie 4.0 rather than pcie 5.0 That! Is exactly what would do it for me and possibly countless others.
I was going to get the 16gb version until rumors came out it was only pcie 4.0, that is what changed my mind. 32gb made me possibly think twice about waiting for pcie 5.0
I believe, they would want as many people to be happy as possible. No, it will not cut the amount of devs, yes, it would indeed bring more people to AMD. If the new 9000 series is indeed pcie 4.0 and 32gb vram I am now tempted to get one even though I was waiting for pcie 5.0 simply because of how much Vram is stated as being placed within the card. No, developers will always choose what works best for them either NVIDIA or AMD or Intel no matter what gamers get it would only help the developers bring in more cash as it would open the door for one reason more a game should cost more. They aren't going to care if consumers have a card as powerful as theirs that just means a finer product. Inflation will indeed happen eventually, hopefully so will paychecks, but without some reason to build a game we will never have them as without this new ram capacity there would be no need.
P.s. while the developer market is needed.. having a program for seasoned developers to receive a discount or free cards seeing that development is like what? No more than 1-10% of the market share (I could be wrong I'm guessing) but that way through a special program they would get a free card so long as they optimized their games for AMD would be a wise move. Also, if you like the company you buy video cards from solely for the sake of buying from that company, just beware of burning cables (speaking about 12VHPWR). So have fun and beware of all the ups and downs of every company.
Its almost certainly the Radeon Pro Line which has 2x memory with double sided vram
Here is the W7900 (70990xtx) with 48 GB
[https://www.amd.com/en/products/graphics/workstations/radeon-pro/w7900.html](https://www.amd.com/en/products/graphics/workstations/radeon-pro/w7900.html)
Hard disagree. You can buy a 16GB 7600xt for $300. The cost of doubling the VRAM to 32GB can't be that much more than the cost of the entire card. Maybe I'm not factoring in bus width, or different, more dense chips, but my kids old 3060 has 12GB.
For AI, and rendering tasks, just having enough RAM is the most important barrier, how fast it works after that is important but not like that first hurdle. For what I do, CFD can use two GPUs (FluidX3D with OpenCL), and Ollama for AI, BOTH can use more than one GPU and it's additive. So getting two GPUs that total 64GB for around $2k would change everything.
This is not a gaming card.
Or else they could have have launched this too with Gaming rdna3 32 GB but didn't
The 32 GB based on rdna3 exists btw
https://www.amd.com/en/products/graphics/workstations/radeon-pro/w7800.html
24GB exists because of the 384 bit bus 384 bit bus allowed for the 48 GB pro card.
Nvidia does the same. 24 GB of the 4090 and the L40/RTX 6000 Ada has 48
The 5090 will get a pro card 64 GB and possiblly 96 GB variant.
The 5080 will get turned into the 32 GB pro card.
Pro cards will get the full future set including fully unlocked fp16 perf (on atleast Nvidia cards) with additional software and enterprise support.
And why would you assume they will do what they have never done, using knowledge that they have always had pro cards with double sided VRAM?
Why would they cannibalise their pro lineup?
32GB Navi48 would not have a different bus width or denser chips, but just clamshell ram.
The spot price for 16Gbit GDDR6 modules is ~$8, add some premium for the top speed grade and a bit more for a more expensive PCB, the cost add to AMD would certainly be below $80.
Yep.
I made no claim on the price difference between the card with doubled ram config and base, only clarified the cost difference.
AMD always gets much higher margins from Pro-grade stuff.
May be a second Radeon VII, failed pro cards?
[https://www.techpowerup.com/gpu-specs/radeon-vii.c3358](https://www.techpowerup.com/gpu-specs/radeon-vii.c3358)
Always wanted one, just there failure rates seemed a tad high.
I would've bought one if they made sense vs the competition. But then again, I had a 1080ti back then. It would've been a sidegrade and the crypto boom made them unubtanium.
I'm all for it, and would pay extra as long as the extra was reasonable. But it's AMD, if it makes sense for consumers, they probably won't do it.
They're not the same company that brought us the first few Ryzen CPUs.
Considering the state of ROCm, sacrificing some of their Pro margins in the short term to bring more devs into their ecosystem would probably be a smart move. Which means AMD is unlikely to do it.
It would be a bold and desperately needed move by AMD. It would be incredible if true.
This would have the greatest chance of destroying CUDA moat. Tons of GitHub projects would try to support this card, by pushing for agnostic alternatives like TinyGrad.
Heck, I'm in the "CUDA is required for my work" camp and I would buy this card in a heartbeat.
I doubt it, the reason being... simple AMD has a track record of picking the ball up and dropping it with in 1-2 generations. There going to need to prove there reliable first. Otherwise why put in the work and effort.
Nividia if nothing else is reliable you know there going to continue down the same path and innovate. Where AMD might have a brain fart decide to go branch off in a completely unrelated direction while completely dropping what they were doing and ignoring it.
It was unified back when OpenCL and co. were the focus, that just fell apart and not maintained as the focus changed. Also about the same time AMD nearly went bankrupt, if we're looking at reasons *why* that might be :P
It depends on the workload I suppose. AI stuff apparently only needs capacity rather than bandwidth so if that is the target market maybe it will be fine.
But with GDDR6 the 9070s will have 640GB/s of memory bandwidth, compare that to the 5090's *1792*GB/s with GDDR7 and double the memory bus width. This is only part of the equation for a card's performance, but games won't benefit much from just doubling the RAM on a card like this.
If the model you want to run doesn't fit in VRAM then bandwidth don't mean shit as you are stuck using the CPU and performance will be awful. More VRAM is always better for AI workloads.
Yes, you'd need multiple cards whatever you buy, so you'd want the cheapest vram/price ratio card that sums up to R1's total size within whatever PCIe slots you can get (mining cards got to 8 slots I think?)
Those have 8GB less VRAM, and the 4090 costs way more. The 9070XT should also outperform the 7900XTX in AI tasks based on all the AI upgrades to Radeon driver features for RDNA4.
It will no doubt "support" ROCm, but to what extent is entirely up to AMD's dev team. If they put even half the effort into the GPU software stack that nVidia does, they would be far more competitive.
Only for some spesific things, every new open source tool and library you see coming out is build for cuda but if you are just infering a regular LLM then it should work.
Been using my 7900xtx for LLMs for over a year. And it has been smooth sailing for me. I would definitely get two of these GPUs if they were priced for consumers as vRAM capacity is a major bottleneck.
A 9070xt with 32gb VRAM is a good idea. It caters to the crowd who needs it for work, as well as enthusiasts who want to pay more for the "best" card. They can probably earn more % profit since it doesn't need to be great value like the regular 9070xt.
A "gaming" 9070xt with 32gb VRAM is stupid as heck. Not even Nvidia can put 32gb on a card and call it "gaming"
Entirely unnecessary for a mid-range card, but there might be some buyers. If you actually need 32GB you'd want a high core count to go along with it... and most people would probably want CUDA to go along with it too.
It is alleged that when they realised that the 9070 was closer than they expected to the 5080 they pivoted to a 9080 32gb idea that could be just a few % above the 5080
Could just be one that they want to market to people looking to run LLMs locally too, double the memory is super useful for getting the higher parameter models to run.
I'm going to take these with a pinch of salt.
Why are we hearing that AMD aren't going to compete in the high end market, yet still hundreds of rumours of high end cards?
My question is, why a 9070 with that much ram specially? I’d expect that amount of VRAM for something like a 9090 or a 9090xt, which we clearly aren’t getting. At least not anytime soon.
If someone is making cinematics/product videos in UE5 editor or doing some AI this would be an amazing card.
Complex CG scenes (no game-like optimization) in UE5 editor can even crash a 24 GB card. AI models don't even start below VRAM requirement.
You still need to be able to interface all the chips with the soc. That alone means a much larger soc for the extra pinouts.
So they would have a whole separate chip for this one card that costs them a whole lot more and they get fewer chips per wafer.
I don't buy it. If they did this it would have to be a more performance chip to make it worthwhile. It would be a 9080 or 9090 not a 9070.
Not when they put half of memory on a back of PCB.
4060ti and 4060ti 16GB have a die with exactly the same area: 188mm^(2)
[https://www.techpowerup.com/gpu-specs/geforce-rtx-4060-ti-8-gb.c3890](https://www.techpowerup.com/gpu-specs/geforce-rtx-4060-ti-8-gb.c3890)
[https://www.techpowerup.com/gpu-specs/geforce-rtx-4060-ti-16-gb.c4155](https://www.techpowerup.com/gpu-specs/geforce-rtx-4060-ti-16-gb.c4155)
The RX 7600 XT vs RX 7600 did the exact same thing from AMD too. Tho the XT was an OC as well so that's how they slapped the XT on.
It'd make sense for AMD to clamshell the 9070 XT and make it 32GB. I'd love the 9070 XTX name for funises but apparently just 32GB, fine it is accurate. Some people wanted more than 16GB so it'll find a market even if 700USD. I mean it would be 32GB on the "cheap" since the RTX 5090 is the other 32GB GPU for well over double the price. Near triple probably.
No it's called clamshell mode where you have two 32-bit memory chips sharing the same 32-bit memory controller.
You don't need a new chip with a larger memory bus. You get more capacity at the same bandwidth with this configuration.
Fair enough, didn't see the update that it's just a 9070 xt.
But I still don't believe it.
There's no 4gb chips so they'd have to have a large enough soc to be able to pin out double the amount of chips and have its own board design with half the chips on the back of the board giving it a 512bit bus.
If they said 24gb variant, yeah. But a 32gb variant could only be a professional card.
Yeah... doesn't seem like the greatest product for gamers. I guess we'll have to wait till next gen from both nVidia and AMD to upgrade if we are already anywhere north of a 7800XT / 4070 in performance.
should be a perfectly great product for gamers if amd just ads the vram price difference on top of the 9070 xt 16 GB version.
assuming the 9070 xt 16 GB is priced aggressively, a 32 GB version would be great value short and especially longterm.
16 GB of more gddr6, that is in the worst case 60 us dollars more. probably a bunch less for amd.
if amd charges 500 us dollars or 600 us dollars for the 9070 xt 16 GB and then charges 660 us dollars or 560 us dollars for the 32 GB version, people would buy them without question.
just like they bought the 8 GB version of the polaris 10 cards (rx 480 for example)
32 gigs is just a bit useless on a semi-midrange card if it comes with a price premium.
Like, if they made a 32gig 5070 - it'd just cost more and not perform better.
just give people the choice and let the market decide.
i want my next card to have at least 32 GB vram.
they can have the market decide and sell more graphics cards like they did with the 4 and 8 GB polaris 10 cards.
>if they made a 32gig 5070
the 5070 with 18 or 24 GB would perform better though, because it has only 12 GB vram and with raytracing you can get into scenarios where 12 GB vram becomes an issue. nvidia even straight up told hardware unboxed, that yeah 12 GB vram won't be enough in some scenarios.
nvidia sees the 5070 as the card to NOT get. the compromised card. DELIBERATELY COMPROMISED.
so yeah a 24 GB (clam shell) or an 18 GB (1.5 x capacity one sided) OR a 36 GB version would perform better than the 12 GB insult already in some scenarios according to nvidia themselves when talking to reviewers...
and if you automatically thought, that SURELY the 5070 has 16 GB and not 12 GB, it shows what a middle finger the 12 GB on that card is yet again.
and people who might buy a 9070 xt with 32 GB might use their card or be able to use it for the entire ps6 generation so far into the future.
yes lowered other settings, but max texture quality and still working just fine.
maybe the ps6 will lead to 24 GB vram used at high quality settings, so bye bye 16 GB vram at high quality settings by then.
and cards are used for that long if they still work. people have used (some are still using) polaris 10 cards with 8 GB vram. 8 GB vram is already a major issue, but the 4 GB version said good bye way way way way before the 8 GB vram versions.
btw worth keeping in mind, that both nvidia and amd could leave it up to partners to configure the memory and let the market decide.
partner putting 16 GB on a 4060 non ti for example would vastly outself the 8 GB insult if the difference in price would be just the 8 GB added memory.
24 GB or 18 GB 5070 would vastly outsell the 12 GB insult we can expect.
especially nvidia does NOT want customers to have the choice on vram.
it is NOT a hardware limitation at all even at a bs tiny bus. they just want to scam people and massively upsell people and have planned obsolescence.
people on 12 GB vram will probably be heavily inclined to upgrade once the ps6 comes out and the first ps6 only games come out.
while 18 or 24 GB version people would sit on it for 2 more years possibly as it still runs all games decently fine, if it would exist...
The increase in VRAM over what is used makes no performance gain. There might be a slight gain from having a higher bus width. But if there's no use in it, it doesn't matter.
>There might be a slight gain from having a higher bus width.
the examples mentioned had no bandwidth increases. 1.5 x capacity or clam shell, or 1.5x capacity memory chips + clam shell all have the same memory bandwidth and bus.
and again nvidia straight up told told tech reviewers, that yeah the 12 GB card will already have issues at launch in some games and that it is the compromised version.
"why aren't you getting the working 16 GB 5070 ti, you poor pleb" is the nvidia mentality.
and a disgusting mentality at that.
so 16 GB over 12 GB matters according to nvidia!!! even.
Well, yes, if you want to play all the games with full RT and at max quality. There are loads of games where it won't matter. But I totally agree that nvidia are putting out cards with too low vram for the performance potential.
But that's the thing - having absurd VRAM without the performance potential to utilize it? That's just waste of resources.
They could easily add $200 premium and it would still sell like hot cakes even if it's much slower than 5080. There is no other product like this on the market. Even for a gamer it's super cool to have so much headroom in VRAM.
they probably could, but i'd argue they shouldn't.
they claim to want to gain marketshare that is a great way, while keeping profits the same as the 16 GB card, which will still be great profits at a 500 us dollar price point for the 9070 xt 16 GB.
also would be a way to get a ton of gamers away from nvidia.
a bit slower than a 5080 for vastly less money with double the vram. that is a deal to take.
that is an exciting product.
that is sth, that gets people to switch to amd, especially those, who got burned by an 8 GB 3070 ti already for example.
and charging a 200 euro/us dollar premium for 60 us dollar (they pay probs a bunch less) for the added 16 GB vram would feel disgusting and probs get pooped on reviewers as well.
people would rightfully compare it to the 4060 ti 16 GB scam pricing, that charged 100 us dollars more just to add 8 GB vram and make it a working card.
now 16 GB is for now already a working graphics on the 9070 xt, but charging 200 us dollar premium for at worst 60 us dollars more vram would invite those comparisons.
if they go 500 us dollar 16 GB and 560 us dollars 32 GB (or 600 and 660), then people will literally see it as a pure vram choice for people who want more vram and a pro customer move and reviewers should praise it massively.
that would also be longterm thinking to gain marketshare on a good generation to do it, instead of trying to get a bit more money out of viewer cards sold...
wrong.
clamshell. memory on both sides with the same memory bandwidth.
done in the past. done on the 4060 ti 16 GB insult.
so 32 GB with 256 bit bus is no problem at all with gddr6.
this is also always worth keeping in mind, when a company is lying to you about "oh that is the only possible vram configruation", which is a double lie, because they decided on the memory bus and THEN on top of that decided to only put memory on one side of the card instead of both (clam shell)
they have to compete in gaming if they want to keep in track with AI race, it is actually inter connected if you think about it
Besides gaming there is huge demand for high vram cards for local llm
if you need to have software acceleration fallbacks then yeah, performance will suffer. But hey if you are lucky that you can deploy on AMD and not have to spend triple the unit costs on troubleshooting, go for it.
VRAM capacity is the single most important spec for local AI. If AMD can offer 32 GB at a price point where Nvidia is offering 16, the hobbyists will be tempted to switch, even if it means having to deal with ROCm.
for the same hobbyists:
may i interest you in a 128 GB unified memory strix halo mini pc? ;)
i mean sucks, that you can't get it with ecc it seems, but yeah 128 GB with a basic high end up, when a 5090 is just 32 GB and even the 5090 workstation version would be just 96 GB... well :D
smart amd marketing team (doesn't exist) would look at strix halo and at a 32 GB 9070 xt and market the shit out of vram/unified memory for gamers and professionals.
show games straight up breaking down with some 8 GB mobile dedicated gpu insult and show models or 3d renders eating tons of memory, while the competition breaks or crawls to an almost hault.
but that is what purely hypothetical smart amd marketing team would do among other things, so sadly it will never happen :o
memory size would be the thing, that matters most for people who want to run local llms.
strix halo from my understanding would be fast enough and running a model, that takes up 128 GB would be vastly more interesting.
will be very interesting how much amd will charge for strix halo, because theoretically it doesn't need to be super expensive in a laptop or mini pc. THEORETICALLY.
both would be great options at possibly different price points.
if strix halo is priced agressively, then it could be a great deal if you buy it in a laptop, because you'd also get a great laptop to carry with you, when you aren't running an llm at home theoretically.
but they will probs charge a ton.
either way they should just do united market for vram to the moon options for customers with both products.
it is easy to market sth, when the competition is releasing broken hardware probably again (8 GB vram at laptop for nvidia we expect)
Inferencing on AMD *works*, even if worse than Nvidia. Between a 32 and 16gb GPU for local ai the choice is obvious, you can cope with ROCM on linux, you can't cope with having half the memory.
They cancelled "9080/9090 xt" models believing that they cant catchup to nvidia, but after seeing miserable %10 uplift from 5080 maybe they decided to make it again?? I hope so
I mean, i guess they could make a selling point of having slow gpu with tons of vram. For like AI stuff market.
But it's not really good for gamers since you're going to get a gpu that would probably cost the same or more than a 5070 ti 16gb with less performance than that with a 32gb 9070xt. And like zero actual use for it outside of machine learning
"Gamers" in general are simple to market to, and believe bigger number better, regardless of whether they need it.
It's silly, but just because it's practically useless doesn't mean people won't pay extra for it. We see this all the time with things like $500 motherboards and and 1200W power supplies.
Weird, I thought 9070XT was the only planned monolithic die for RDNA4 and that's the real reason 80 class got canned, because MCM is still too tricky/costly to make sense ?
Imagine how depressed 5080 buyers will be when AMD releases a 9070XT 32GB for $699 that nearly matches their card in speed and has double the VRAM. Maybe it would teach people a lesson about buying nvidia at launch when they try to rip everyone off
AMD could make a card that's twice the speed at half the price, and people would wait for nvidia to lower their prices so they could buy a geforce instead
I’m a big fan of this idea. It’s about time they stopped making the 16GB version the high-end model. This would definitely encourage Nvidia to make the 6080 at least 24GB of VRAM, and every mid-range card should have at least 16GB instead of 12GB. And the low-end cards should have at least 12GB.
This doesn't sound like the worst idea AMD has come up with ("Poor Volta" would fit that description).
DDR6 doesn't have that many customers, so 16GB should cost around 40-50$. Add a bit more for a sandwiched PCB, and AMD could launch a 32GB "9070" below 1000$.
Eh, it's a bit niche. Paying more for no appreciable extra performance in most gaming scenarios seems like it'll keep demand down among people who won't legitimately utilise the extra ram.
Or sheeple will sheeple and buy it to get high huffing the spec sheet. One of the two.
I would definitely huff that spec sheet.
I don't *quite* feel like 32gb is completely necessary but I certainly wanted *something* to offer more than 16gb this generation without costing $5000AUD
A 24GB 5080 would've suited me just fine
Yeah, look, I'd buy one just to throw some weird shit at it and see how it handles it, but it wouldn't be games I don't think. I don't know too much, but my impression was that 16gb is plenty for 4k, and I don't think the compute power is there for 4k triple screen in any real way? Not sure tho, happy to be proven wrong.
How much is plenty depends on the game and what assets it would want to load.
The more detailed and more expansive games are the more they require regardless of resolution.
I want something that will not have an issue storing assets for quite some time. And as far as I'm concerned 16GB today is 8GB 5 years ago. Maybe handles pretty much everything *now*, but this thing's gotta last me a couple of generations.
For me, I like to Flight Sim (for the scenery, I'm a filthy casual) and I play a few games which, thanks to an under provisioned GPU, run afoul of VRAM long before the compute performance is too low.
For my next card, too much is far preferable to 'probably enough', and none of the regular cards have been announced with anything more than "probably enough".
My point is that there are very very few gamers who have a reason to buy this just for gaming. And that's a good thing. It's a niche product and I'd love to see it get made.
Hopefully they come out swinging with UDNA
give something thats like 60 percent on top of a 7900xt..
but go really off book,and do multichiplet,and give it HBM for crazy bandwidth..
i do want to see a meme marketing slide, that shows a 16x fake interpolated frame gen mode.
laughing it up at nvidia's bs and then moving on to ACTUAL performance in the next slide.
moore's law is dead made a lil short showing sth like that:
[https://www.youtube.com/shorts/XYWPKbbwS18](https://www.youtube.com/shorts/XYWPKbbwS18)
but that would be clever marketing, so we won't see that from amd ;)
I see, I don't really follow or watch anything from the guy, but I guess he making the same joke is the reason why my post is being downvoted.
That, or people who really like to be shafted by alligator leather jacket man. So many on this sub. Some people just like the whip, you know? And jacked boy knows his customers very well, so whip it is.
There are people ITT doing the *how can she slap face* because they can't fathom the possibility of a 70/80 class GPU having 32GB of VRAM in big 2025. Those people have been scarred by Nvidia, it's so sad to watch.
In any case yea, if I were AMD I'd really do that. After that, I'd go on a divergent path, do my own thing compared to Nvidia, advance graphics, sustainably, together with the community and partners, which are plenty and influential. AMD owns the whole console and handheld market, they should really stop chasing Nvidia's shenanigans. Those saying AMD should follow Nvidia are monumental morons.
Maybe they realized they can add a heap of VRAM and dump a heap of power into the 9070XT and sell it as a 9080 for extra profit since the 5080 showed a disappointing increase?
Since I'm no AI/Llama/DeepSeek expert (to put it mildly), would anyone mind explaining exactly how reliant these LLM models are on bandwidth?
If all they need is VRAM and don't depend heavily on bandwidth (or at least don’t require a ton of it), then it might not be the worst idea to explore LPDDR-X instead of the more traditional GDDR or HBM.
For example, Samsung currently offers 128Gb @ 32-bit LPDDR5X running at 7.5 Gbps. That means you could have up to 128GB on a 256-bit wide bus, though the bandwidth would be roughly comparable to something like a GTX 1070 Ti (8 Gbps GDDR5X).
That said, I’m not entirely sure if 128GB would be enough to run the full-blown 236B parameter model locally at 4-bit.
Bandwidth is important. 9070xt has 624.1 GB/s bandwidth which is nothing to sneeze at. It's slower than 3090, 7900xtx but not by much.
To put it in perspective this is still twice as much bandwidth that Strix Halo or Nvidia's DIGITS.
Even low end GPUs mostly have more RAM than my whole system. I need to go look at a modern AAA game. I don’t game the last one I saw and was like woah was Arma maybe.. like 2017
You aren't going to train models on desktop GPUs. You usually need much more compute and datacenter GPUs for that.
You can fine tune on Radeon using qLoRa though. And yes inference.
OK, that's pretty impressive since I think you're currently looking at the workstation type cards which run thousands of USD to get 32 gb and decent GPU power. Good call.
I agree totally!
The rumored Apple M4U chip will likely be on the level of a 4090 Desktop GPU, which is based on past Blender results and extrapolation
And it’s also rumored to have 256 GB of memory.
So you cohld have 4090 levels of power with 256 GB of graphics memory. And because the memory is unified (and not shared), it means you truly have access to all that memory for anything you want to do!
Freaking ay, that's what having design control of the end to end platform and a ready made consummer base will do for you!
I gotta admit, I was puzzled when Apple said they were going to bring chip making in-house, but it was a 100% genius move.
I agree! Exactly!
One more thing. I’ve seen some videos of an Apple MLX Engineer (MLX and Metal are most equivalent to “CUDA”) testing those Large transformer models, and he was able to connect 3 M2U chips with 192 GB of memory each, and while running a LLM machine learning model, the whole thing only took 100 watts of energy at its peak, with the average being 60 watts for all 3 Macs together.
So a 4070s performance with lots of memory, but it's a shared memory, not GPU only memory, still at 128GB it doesn't make much difference in most cases.
It absolutely makes a difference lol. The memory isn’t “shared,” rather it’s stored in one place. Instead of one pool of memory being split between the CPU and GPU, all data is stored in the memory and can be simultaneously accessed and manipulated by both the CPU and GPU. It’s not like shared memory models such as Intel’s iGPU
Well traditional iGPUs use shared memory, which is taking a portion of memory and allocating it. The CPU or GPU cannot directly manipulate it without first having to copy the data and send it to either (example CPU wants to manipulate data that the GPU is working on, it needs to first copy that data)
Whereas the unified model is one single memory pool where both the CPU and GPU can access data without needing to copy it first. It speeds stuff up and saves on power, while also providing large amounts of memory. LPDDR has lower bandwidth than GDDR or HBM, but the benefit is 1) large amounts of it, 2) low energy costs, 3) not as expensive.
Hello HLumin! Please **double check that this submission is original reporting and is not an unverified rumor or repost** that does not rise to the standards of /r/hardware. If this link is reporting on the work of another site/source or is an unverified rumor, please delete this submission. If this warning is in error, please report this comment and we will remove it.
*I am a bot, and this action was performed automatically. Please [contact the moderators of this subreddit](/message/compose/?to=/r/hardware) if you have any questions or concerns.*
261 Comments
NGGKroze@reddit
reddit_equals_censor@reddit
Jeep-Eep@reddit
reddit_equals_censor@reddit
Jeep-Eep@reddit
Mnmemx@reddit
Jeep-Eep@reddit
pyr0kid@reddit
Jeep-Eep@reddit
Morningst4r@reddit
noiserr@reddit
PalpitationKooky104@reddit
NeroClaudius199907@reddit
moschles@reddit
noiserr@reddit
Strazdas1@reddit
noiserr@reddit
Strazdas1@reddit
moschles@reddit
noiserr@reddit
szczszqweqwe@reddit
Aquaticle000@reddit
MiloIsTheBest@reddit
reddit_equals_censor@reddit
Merdiso@reddit
reddit_equals_censor@reddit
Strazdas1@reddit
reddit_equals_censor@reddit
Strazdas1@reddit
Strazdas1@reddit
Merdiso@reddit
Strazdas1@reddit
noiserr@reddit
reddit_equals_censor@reddit
RedPanda888@reddit
Jaznavav@reddit
szczszqweqwe@reddit
SubstantialInside428@reddit
Jeep-Eep@reddit
FormalIllustrator5@reddit
FormalIllustrator5@reddit
Reggitor360@reddit
reddit_equals_censor@reddit
Ascender4ever@reddit
Plank_With_A_Nail_In@reddit
Ascender4ever@reddit
Ascender4ever@reddit
Ascender4ever@reddit
From-UoM@reddit
mckirkus@reddit
From-UoM@reddit
mckirkus@reddit
From-UoM@reddit
HilLiedTroopsDied@reddit
ResponsibleJudge3172@reddit
PalpitationKooky104@reddit
mckirkus@reddit
Vb_33@reddit
Tuna-Fish2@reddit
PalpitationKooky104@reddit
ResponsibleJudge3172@reddit
Tuna-Fish2@reddit
JakeTappersCat@reddit
Strazdas1@reddit
the_dude_that_faps@reddit
liaminwales@reddit
the_dude_that_faps@reddit
HilLiedTroopsDied@reddit
the_dude_that_faps@reddit
PorchettaM@reddit
kontis@reddit
HotRoderX@reddit
Extra-Advisor7354@reddit
Banned_Dont_Care@reddit
topazsparrow@reddit
Apprehensive-Buy3340@reddit
ProperCollar-@reddit
BiomassDenial@reddit
Jonny_H@reddit
acc_agg@reddit
PalpitationKooky104@reddit
ExtendedDeadline@reddit
lordofthedrones@reddit
Gansaru87@reddit
simplyh@reddit
Typical-Common3955@reddit
AssholeFramed@reddit
OafishWither66@reddit
FuturePastNow@reddit
pmth@reddit
FuturePastNow@reddit
Strazdas1@reddit
Plank_With_A_Nail_In@reddit
Plank_With_A_Nail_In@reddit
Strazdas1@reddit
Kionera@reddit
Tiny-Independent273@reddit
Jaznavav@reddit
Strazdas1@reddit
Terrh@reddit
Jaznavav@reddit
Terrh@reddit
bill_cipher1996@reddit
Mean-Professiontruth@reddit
F9-0021@reddit
ea_man@reddit
callanrocks@reddit
Apprehensive-Buy3340@reddit
gurugabrielpradipaka@reddit
Kionera@reddit
sheldonrong@reddit
PalpitationKooky104@reddit
NeroClaudius199907@reddit
noiserr@reddit
Kionera@reddit
SomniumOv@reddit
iamthewhatt@reddit
Strazdas1@reddit
FinBenton@reddit
noiserr@reddit
sandor2@reddit
Kryohi@reddit
GenericUser1983@reddit
PalpitationKooky104@reddit
mokkat@reddit
Farren246@reddit
HLumin@reddit (OP)
OafishWither66@reddit
iothomas@reddit
Jaegs@reddit
animeman59@reddit
Krotiuz@reddit
hitsujiTMO@reddit
Aquaticle000@reddit
Strazdas1@reddit
Aquaticle000@reddit
szczszqweqwe@reddit
Aquaticle000@reddit
kontis@reddit
Aquaticle000@reddit
Jeep-Eep@reddit
the_dude_that_faps@reddit
TheAgentOfTheNine@reddit
szczszqweqwe@reddit
hitsujiTMO@reddit
Strazdas1@reddit
szczszqweqwe@reddit
Hayden247@reddit
szczszqweqwe@reddit
Vushivushi@reddit
Numerlor@reddit
Jeep-Eep@reddit
tobimai@reddit
moschles@reddit
Visible_Witness_884@reddit
hitsujiTMO@reddit
Visible_Witness_884@reddit
reddit_equals_censor@reddit
Visible_Witness_884@reddit
reddit_equals_censor@reddit
Visible_Witness_884@reddit
reddit_equals_censor@reddit
Visible_Witness_884@reddit
kontis@reddit
reddit_equals_censor@reddit
reddit_equals_censor@reddit
Healthy_BrAd6254@reddit
the_dude_that_faps@reddit
LuminanceGayming@reddit
ehxy@reddit
gatorbater5@reddit
Raining_dicks@reddit
RealOxygen@reddit
Significant_L0w@reddit
verci0222@reddit
iMaexx_Backup@reddit
verci0222@reddit
iMaexx_Backup@reddit
-Purrfection-@reddit
shugthedug3@reddit
Strazdas1@reddit
noiserr@reddit
iMaexx_Backup@reddit
Mean-Professiontruth@reddit
PorchettaM@reddit
reddit_equals_censor@reddit
devinprocess@reddit
noiserr@reddit
reddit_equals_censor@reddit
kontis@reddit
Jaznavav@reddit
Archer_Key@reddit
Krugboi@reddit
Strazdas1@reddit
amazingspiderlesbian@reddit
Krugboi@reddit
amazingspiderlesbian@reddit
Slyons89@reddit
noiserr@reddit
SubstantialInside428@reddit
JakeTappersCat@reddit
TateEight@reddit
TateEight@reddit
rTpure@reddit
Reasonable_Can_5793@reddit
Optimal_Visual3291@reddit
Astigi@reddit
TigerBalmES@reddit
EntertainmentMean611@reddit
Gronkers@reddit
ea_man@reddit
Doubleyoupee@reddit
SubstantialInside428@reddit
PalpitationKooky104@reddit
SubstantialInside428@reddit
funix@reddit
Stennan@reddit
Jeep-Eep@reddit
SnowZzInJuly@reddit
Fer-Butterscotch@reddit
Jeep-Eep@reddit
MiloIsTheBest@reddit
Turtvaiz@reddit
Fer-Butterscotch@reddit
MiloIsTheBest@reddit
NGGKroze@reddit
Fer-Butterscotch@reddit
Visible_Witness_884@reddit
ButtPlugForPM@reddit
ga_st@reddit
reddit_equals_censor@reddit
ga_st@reddit
Ch1kuwa@reddit
Reggitor360@reddit
AreYouAWiiizard@reddit
GenZia@reddit
noiserr@reddit
koolaidismything@reddit
lordofthedrones@reddit
_Restless_Spirit_@reddit
moschles@reddit
noiserr@reddit
noiserr@reddit
noiserr@reddit
PeakBrave8235@reddit
Fer-Butterscotch@reddit
PeakBrave8235@reddit
Fer-Butterscotch@reddit
PeakBrave8235@reddit
Fer-Butterscotch@reddit
PeakBrave8235@reddit
Fer-Butterscotch@reddit
PeakBrave8235@reddit
Fer-Butterscotch@reddit
szczszqweqwe@reddit
PeakBrave8235@reddit
szczszqweqwe@reddit
PeakBrave8235@reddit
Kionera@reddit
PeakBrave8235@reddit
AutoModerator@reddit