From 4090 to 5090 to RTX PRO 6000… in record time

Posted by Fabix84@reddit | LocalLLaMA | View on Reddit | 258 comments

From 4090 to 5090 to RTX PRO 6000… in record time

Started with a 4090, then jumped to a 5090… and just a few weeks later I went all in on an RTX PRO 6000 with 96 GB of VRAM. I spent a lot of time debating between the full power and the Max-Q version, and ended up going with Max-Q.

It’s about 12–15% slower at peak than the full power model, but it runs cooler, pulls only 300W instead of 600W, and that means I can add a second one later without melting my power supply or my room. Given how fast I went from 4090 → 5090 → RTX PRO 6000, there’s a real chance I’ll give in to the upgrade itch again sooner than I should.

I almost pre-ordered the Framework board with the AMD AI Max+ 395 and 128 GB unified RAM, but with bandwidth limited to 256 GB/s it’s more of a fun concept than a serious AI workhorse. With the RTX PRO 6000, I think I’ve got the best prosumer AI hardware you can get right now.

The end goal is to turn this into a personal supercomputer. Multiple local AI agents working 24/7 on small projects (or small chunks of big projects) without me babysitting them. I just give detailed instructions to a “project manager” agent, and the system handles everything from building to testing to optimizing, then pings me when it’s all done.