Building a local RAG server

Posted by autonom1a@reddit | LocalLLaMA | View on Reddit | 15 comments

Hi. Corporate wants me to build a local RAG server. 50-100 concurrent interactions with the model few times a day at the first stage and 100-1000 when deployed to production.

I want to understand the hardware stack and its price. Maybe options.

Halp.