Recommendations for best usage of current resources
Posted by decrement--@reddit | LocalLLaMA | View on Reddit | 3 comments
Been reading papers on Magentic-One, Llama, Phi-4, etc. Really interested in the Magentic-One (Multi Agentic approach), and have some hardware to play around with. Please help me choose an ideal setup.
Hardware that I have:
2x3090
1x2080Ti
1x970 (probably useless now)
1xK80 (also useless now)
Computers:
Intel i9-10900KF, 2x16GB DDR4, 2TB NVMe
Ryzen 5700X, 4x8GB DDR4, 1TB NVMe, 500GB SSD
(NAS) R730xd 2x12 Core E5-2678V3 (2.5GHz), 128GB DDR4, ~32TB HDD storage, 2x128GB SSD
I am thinking I will put the 2x3090 in the intel machine, with NVLink, and try to run the 70b models in 4bit. I can use the 2080Ti in the AMD machine, running an 11B model.
Overall, my goal is to fork Magentic-One, allowing for individually configurable agents with different LLMs.
So if you were in my shoes, what models would you choose, and how would you leverage this? Right now I don't see myself training much more than a LoRA, and my goal is to have an LLM system capable of Software Project planning, code/repo surfing, and some code generation.
Finally, what would your growth plan be after this? Move towards a single machine and more cards?
3 Comments
Thireus@reddit
decrement--@reddit (OP)
decrement--@reddit (OP)