RTX Spark will have up to 600GB/s of memory bandwidth.
Posted by fallingdowndizzyvr@reddit | LocalLLaMA | View on Reddit | 65 comments
We just assumed that since it's a GB10 variant that it would have the same memory bandwidth as DGX Spark, 273GB/s. But it's reported that it will have double that, 600GB/s.
"The unified memory architecture brings up to 128GB of LPDDR5X RAM with a bandwidth of 600GB/s"
https://wccftech.com/nvidia-enters-pc-space-with-rtx-spark/
"its memory bandwidth peaks at 600 GB/s"
https://www.notebookcheck.net/Nvidia-N1X-officially-confirmed-to-arrive-as-the-RTX-Spark.1312010.0.html
nostriluu@reddit
It could still be interesting if it supports oculink. Which I'd think Nvidia would want for a PC + eGPU sale focused on the CUDA ecosystem.
fallingdowndizzyvr@reddit (OP)
It doesn't even need to have Oculink. Since USB4/TB4 eGPU enclosures also work just fine.
nostriluu@reddit
"The primary technical advantage of OCuLink over USB4 and Thunderbolt 4 is the complete absence of protocol encapsulation and abstraction layers. USB4 and Thunderbolt 4 rely on internal controllers to package native PCIe data into USB packets, transport those packets over the cable, and then unpack them back into PCIe data at the receiving end, a process that introduces significant transmission latency and processing overhead while capping PCIe data allocation at a maximum of 32 Gbps. In contrast, OCuLink functions as a direct, unshielded physical extension of the motherboard's PCIe bus over copper wire, maintaining native, raw PCIe 4.0 x4 signaling at 64 Gbps without packet modification, thereby providing the external device with direct memory access to the host processor at minimum latency."
Will the system support hybrid AI though?
joes-tech-adventures@reddit
It will have 300 GB/s of memory bandwidth and the 600 GB/s figure is for the NVLink C2C interconnect that bridges the CPU with the GPU.
fallingdowndizzyvr@reddit (OP)
Then Nvidia really should not be listing that as the fine print under the 128GB of memory then. Since it really doesn't have anything to do with memory does it?
tmvr@reddit
Seems pretty clear to me:
https://www.techpowerup.com/349554/nvidia-announces-rtx-spark-a-supercomputer-grade-processor-for-windows-pcs-with-agentic-user-interfaces
fallingdowndizzyvr@reddit (OP)
Have a look at this slide. "600 GB/s" right under "128 GB Unified Memory".
https://www.notebookcheck.net/fileadmin/processed/a/2/csm_RTX-Spark-specs_dd5b710e5c.jpg
tmvr@reddit
It doesn't say "600 GB/s", it says "600 GB/s NVLink C2C", pretty clear what it is referring to.
fallingdowndizzyvr@reddit (OP)
Which has nothing to do with memory bandwidth. Which has nothing to do with "128 GB Unified Memory". So why is it the fine print under "128 GB Unified Memory"? Look at all the other bullet items on that slide. All the other fine prints under the bullet items relate to that bullet item.
tmvr@reddit
It's not a fine print. Also, why are you ignoring the actual official slide from the deck that I pasted in? Nothing is hidden, you're chasing ghosts.
fallingdowndizzyvr@reddit (OP)
LOL. That is an official slide I posted. Why are you ignoring it?
srigi@reddit
My friend, ask 10 kids at elementary school, how do they interpret numbers on image and let us know why they are, according to you, wrong.
fallingdowndizzyvr@reddit (OP)
My fool, that's not the slide I'm talking about. This is.
https://www.notebookcheck.net/fileadmin/processed/a/2/csm_RTX-Spark-specs_dd5b710e5c.jpg
Which you would know if you knew how to read. Ask 10 kids in elementary school to teach you how to do that. I'm sure they would be happy to teach you.
You looked at the wrong slide. That was not the slide being discussed.
entsnack@reddit
I think it's pretty clear TBH and you misread.
fallingdowndizzyvr@reddit (OP)
Really. Explain how I misread this.
"The unified memory architecture brings up to 128GB of LPDDR5X RAM with a bandwidth of 600GB/s"
Explain to me the "600 GB/s" right under "128 GB Unified Memory" in this slide.
https://www.techpowerup.com/349554/nvidia-announces-rtx-spark-a-supercomputer-grade-processor-for-windows-pcs-with-agentic-user-interfaces
Look at all the other bullet items. The fine print under each one has to do with the bullet item. NVlink doesn't have anything to do with the 128 GB of Unified Memory.
rerri@reddit
Wccftech is not a very reliable source of information. They constantly get shit wrong.
entsnack@reddit
I dunno man I think this stuff is for a more technical audience and you should just watch a YouTube video about it instead.
fallingdowndizzyvr@reddit (OP)
Ah... you have to watch a YouTube video to learn how to read this sentence.
"The unified memory architecture brings up to 128GB of LPDDR5X RAM with a bandwidth of 600GB/s"
Dude, if that's what it takes. Please do so.
sn2006gy@reddit
I doubt this is true, if so, there will be LOTS of pissed off Spark Users!
pmttyji@reddit
Price is gonna be around $6-7K probably so less chance.
Snoo_81913@reddit
OOF 6-7 grand and the leaked benchmarks right now are saying it's about the same as an M3 Max setup. Is that accurate or did I read that wrong?
sn2006gy@reddit
at 600gb/s it would be worth \~5k but again, i think people are mixing the aggregate synchronous bandwidth as pure unidirectional throughput.
Virtual_Magazine_860@reddit
Why do people buy these? Because they are pretty?
entsnack@reddit
if you have to ask...
Zeeplankton@reddit
I literally cannot imagine them bother to sell these at 6-7k for a 128gb config. That would be utterly laughable.
Snoo_81913@reddit
Early adopter tax. 😅🤣
entsnack@reddit
It's a power-limited mobile chip in a Windows laptop lol, I think w'll be fine. 😄 Early adopters got the Spark at $4,000 per.
Deep90@reddit
I'll be disappointed if the spark users never are.
The spark is not nearly enough.
fallingdowndizzyvr@reddit (OP)
It's right there on a Nvidia slide during the presentation.
https://www.notebookcheck.net/fileadmin/processed/a/2/csm_RTX-Spark-specs_dd5b710e5c.jpg
Look_0ver_There@reddit
That's C2C (Core2Core) NVLink, not memory bandwidth
fallingdowndizzyvr@reddit (OP)
Nvidia really should not be listing that as the fine print under the 128GB of memory then. Since it really doesn't have anything to do with memory does it?
sn2006gy@reddit
like i said, it doesn't change anything. if this is true - and it isn't the "Aggregate. bandwidth available" but is actually 600gb/s, there will be lots of pissed off spark users.
EbbNorth7735@reddit
Technology can't improve!
FullstackSensei@reddit
Not like this. This is a rebadged and GB10 with different power settings. The GB10 is already running at 8533MT. Even if you push it to 9500MT, that would barely push memory bandwidth past 300GB/s (up from 273).
Pushing to 600GB/s would require 8 memory channels at 9500MT up from 4 in the GB10. That's M5 Max territory, which is a much bigger chunk of silicon.
The memory interface in the GB10 is on the CPU side (which is designed by Mediathek). That chip is way too small to have that many memory channels. Otherwise, why neuter the DGX Spark to four channels when it has a much higher power budget?
sn2006gy@reddit
oh it can improve, but that doesn't mean it won't piss off those spark users who paid a MASSIVE premium for inferior product.
munkiemagik@reddit
Am I being a moron here? I don't understand what all the RTX Spark hype is suddenly. Aren't the DGX and RTX actually in fact the same thing? The DGX already had 600GB/s NVLINK C2C. So exact same GB10 block except
So this is a big fat nothing burger, just a repackaging and marketing pivot?
carrot_gg@reddit
OP is retarded.
rpkarma@reddit
People can’t read, and they’re telling on themselves with posts like the one we’re replying to lol
SurpriseOk6927@reddit
600gb/s on consumer hardware changes the local LLM game bandwidth not raw compute is the real bottleneck for decoding this changes what you can run at home
fallingdowndizzyvr@reddit (OP)
Unfortunately it's not true. It's just a confusing slide led to articles writing something not true. It's no different from DGX Spark.
mjsxi__@reddit
y'all really need to learn how to read.
fallingdowndizzyvr@reddit (OP)
Please educate us on how you read this then.
"The unified memory architecture brings up to 128GB of LPDDR5X RAM with a bandwidth of 600GB/s"
mjsxi__@reddit
again. y'all really need to learn how to read.
fallingdowndizzyvr@reddit (OP)
Again, I'm still waiting for you to explain how you read that.
entsnack@reddit
> ... With NVlink ...
fallingdowndizzyvr@reddit (OP)
Where does it say "With NVlink" in that sentence?
entsnack@reddit
Here you go, a screenshot from your notebookcheck link with "With NVlink" highlighted.
fallingdowndizzyvr@reddit (OP)
Except that sentence isn't from notebookcheck. It's from here.
"The unified memory architecture brings up to 128GB of LPDDR5X RAM with a bandwidth of 600GB/s"
https://wccftech.com/nvidia-enters-pc-space-with-rtx-spark/
Again. Where does it say "With NVlink" in that sentence?
munkiemagik@reddit
To be fair to u/fallingdowndizzyvr they did read good, its just wccftech wrote bad!
The article does actually say it in all its glorious wrongness...
"...The unified memory architecture brings up to 128GB of LPDDR5X RAM with a bandwidth of 600GB/s. with five lanes of PCIe Gen 5.
This memory architecture is enabled by NVIDIA’s NVLink C2C, which is similar to Apple’s UltraFusion..."
So shame on wccftech.
Even the parity to Apple UltraFusion is waffle, thats die to die, ie jamming two M Max together to make an M Ultra. The NVL C2C is interconnecting CPU and GPU blocks to share address space.
UF is bonkers dense parallel on silicon and C2C is much less dense serial across substrate. (this stuff is way beyond me I'm just reading around and interpreting out of curiosity because the 600GB/s title piqued my interest thinking I could finally dump the big box multi GPU LLM server so if I've said anything incorrect someone please do come along and slap my wrist and correct me, oh well back to the old drawing board)
mjsxi__@reddit
nah theyre up and down this tread intentionally being obtuse — if they weren't acting foolish I'd be inclined to agree that it was worded poorly but clearly they don't want to understand.
fallingdowndizzyvr@reddit (OP)
LOL. Says the one that's intentionally being obtuse and doesn't want to understand.
Again, I'm still waiting for you to explain how you read this.
"The unified memory architecture brings up to 128GB of LPDDR5X RAM with a bandwidth of 600GB/s"
Not a different sentence. That sentence. You know, the sentence you've been dodging.
mjsxi__@reddit
its been explained to you 100x over bother someone else
fallingdowndizzyvr@reddit (OP)
And you've dodge explaining how you read that sentence over and over and over again. Not what you think it should say. But what it actually says.
"The unified memory architecture brings up to 128GB of LPDDR5X RAM with a bandwidth of 600GB/s"
putrasherni@reddit
M5 max is 600 or 800 GB/s ?
pmttyji@reddit
So it seems it's gonna take 1-2 years for 256/512 GB variants.
AMD's 192GB variant's release on 2027.
M5 Mac studio's release (rumored) scheduled on October, else 2027.
mindwip@reddit
Um I thought amd just said at comouterx 192gb version is coming pre-orders in June.
2027 is Medusa halo,
Aroochacha@reddit
No? The listed bandwidth is 300 GB/s. This is talking about networking two of them though I don't see how as units I've seen or read about are missing the connect-x7 ports. So far.
fallingdowndizzyvr@reddit (OP)
Exactly, as I said in another thread. They must mean that's 600GB/s of aggregate bandwidth across 2 Sparks.
JustASheepInTheFlock@reddit
128 G is not enough for windows+wsl+crowd-strike+anti-virus+backdoors
EveningIncrease7579@reddit
300 GB/s for you beta testers, 600 GB/s for stable release.
g_rich@reddit
It's 600GB/s for NVLink C2C which is the same as the DGX Spark.
tengo_harambe@reddit
what about the PP
marutthemighty@reddit
Is this ready for release? Or is it beta testing? Or is it still 1-2 years in the near future?
ImportancePitiful795@reddit
It will be funny if it is at same price with Spark 🤣
FlamaVadim@reddit
sure...