RTX 5090 vs M5 Ultra: Analyzing the "2.7x Faster" claim and what Nvidia didn't show you.

Posted by Major_Commercial4253@reddit | LocalLLaMA | View on Reddit | 8 comments

Hey everyone,

I recently came across Nvidia's official graphic comparing the RTX 5090 to the M3 Ultra, claiming a "2.7x speed advantage" in local AI tasks (LLMs). As a developer, it felt a bit like comparing apples to oranges especially with the M5 Ultra just around the corner at WWDC '26.

I did a deep dive into the architecture, memory bandwidth, and what happens when you try to run a 70B+ model on a 32GB VRAM card vs. Apple's Unified Memory.

A few key takeaways from my analysis:

The "2.7x" gap is mostly due to memory bandwidth, which the M5 Ultra is expected to bridge significantly (1.1 - 1.2 TB/s).
The RTX 5090 hits a wall with 70B+ models, while the M5 Ultra handles them entirely on-chip.
The power efficiency gap is still insane (575W vs \~100W).

I wrote a full breakdown of the specs, the "Single Die" rumor for the M5, and why Nvidia chose specific small models for their marketing.

Full article here

[-]

StupidScaredSquirrel@reddit

Why are you even talking about a 70b+ model when gemma4 specifically goes up to 31b? This sounds like llm generated nonsense that still uses llama 70b in its training data as a standard for open models.

[-]

tsukuyomi911@reddit

Where are getting rtx 5090 for 2000$ ?

[-]

Major_Commercial4253@reddit (OP)

The situation was even worse back then; I had bought it at the base prices, but now the prices are much higher, almost the same price as a Mac Studio. At that point, getting a Mac Studio seemed much more sensible.

[-]

Sixhaunt@reddit

Well first you need to buy a time machine, but with the prices of GPUs these days it's probably cheaper to buy the time machine and the $2000 5090 rather than the 5090 today

[-]

Technical-Earth-3254@reddit

This post feels sloppy.

[-]

infalleeble@reddit

lmao the bot reply below

so much slop

[-]

Major_Commercial4253@reddit (OP)

Appreciate the honesty. Which specific part feels sloppy to you? Is it the M5 monolithic die rumors or the VRAM bandwidth comparisons? I'd love to dive deeper into the technicals if you have a specific critique.

[-]

Recoil42@reddit

This comment feels sloppy.