Recommendations for best usage of current resources

Posted by decrement--@reddit | LocalLLaMA | View on Reddit | 3 comments

Been reading papers on Magentic-One, Llama, Phi-4, etc. Really interested in the Magentic-One (Multi Agentic approach), and have some hardware to play around with. Please help me choose an ideal setup. Hardware that I have: 2x3090 1x2080Ti 1x970 (probably useless now) 1xK80 (also useless now) Computers: Intel i9-10900KF, 2x16GB DDR4, 2TB NVMe Ryzen 5700X, 4x8GB DDR4, 1TB NVMe, 500GB SSD (NAS) R730xd 2x12 Core E5-2678V3 (2.5GHz), 128GB DDR4, ~32TB HDD storage, 2x128GB SSD I am thinking I will put the 2x3090 in the intel machine, with NVLink, and try to run the 70b models in 4bit. I can use the 2080Ti in the AMD machine, running an 11B model. Overall, my goal is to fork Magentic-One, allowing for individually configurable agents with different LLMs. So if you were in my shoes, what models would you choose, and how would you leverage this? Right now I don't see myself training much more than a LoRA, and my goal is to have an LLM system capable of Software Project planning, code/repo surfing, and some code generation. Finally, what would your growth plan be after this? Move towards a single machine and more cards?

3 Comments

[-]

Thireus@reddit

Placing 2x3090 and 1x2080Ti in one single computer would allow for more flexibility because the VRAM would be combined and you can distribute it across models as you see fit. Use razer cables to make the GPUs fit all PCIe slots. NVLink is not a requirement. But that's just my opinion.

decrement--@reddit (OP)

Thanks

Reply to Post

3 Comments