I need help building a 96 GB VRAM setup
Posted by emrecengdev@reddit | buildapc | View on Reddit | 6 comments
Hello everyone
I am an intern in a software development team.
The company liked the ocr system I developed myself (it uses deep learning models). The projects I made using Gemini api's are the same way.
Unfortunately, the best server I can use in terms of gpu in the office now uses 3050ti. This vram is quite insufficient for both model training, deep learning models and running a local language model. (We want to work locally for data privacy and cost reasons)
As far as I researched 4 x 3090 setup is one of the best budget choices for me. However, I am having trouble finding a motherboard and processor.
For a system with 4 cards (if there is a motherboard with more slots, it would be much better), should all pcie slots be at 16x speed? In some places, there were those who said that the extraction speed would be affected by a maximum of 10% (when x16 and x8 are used together), and in some sources, there were those who said that the speed would drop close to 50%.
According to my usage scenario, do all slots have to be x16 ? Or x16 x8 hybrid use will not give me much performance loss ?
Because motherboards with 4 x16pcie slots are usually server cards. The company does not buy second hand hardware (some new processors are not stocked in my country). These cards are compatible with processors such as the latest threadripper, which creates a high cost that they cannot accept.
There is also an nvlink scenario, if 2 3090s working in 16x and 8x slots are connected with nvlink, will performance loss be prevented? (If this scenario works, I can use i9 processor etc compatible motherboards that host 2 x16 x8 x8).
Comfortable-Mine3904@reddit
You need to look at the workstation or sever level hardware for this.
AMD's thread ripper series should do this
emrecengdev@reddit (OP)
Yes, I realise, my biggest limitation is not being able to use second-hand equipment. That's why I can't use the new generation server processors because they are way over budget.
Comfortable-Mine3904@reddit
This project probably can't be done at your price point if that's the case.
3090's themselves are really only a great deal because of the used market
emrecengdev@reddit (OP)
I persuaded the company for second hand. I am now researching systems on the internet and new things are constantly coming out. For example, vendor locked amd epyc processors. If I had set up a setup without learning this, the result could have been frustrating. Apart from that, do you know any other points I should pay attention to?
Verdreht@reddit
LGA1700 CPUs have 20 PCIe lanes (+4 chipset). AM5 CPUs have 24 PCIe lanes (+4chipset). There's no way you're running a quad 16x or quad 8x setup on either of these platforms, best you can do is quad 4x. You're going to have to look into server equipment.
emrecengdev@reddit (OP)
So I need a cpu with at least 64 pcie lanes. thank you.