Building a Mini PC for aya-expanse-8b Inference - Recommendations Needed!
Posted by Whiplashorus@reddit | LocalLLaMA | View on Reddit | 35 comments
Hello everyone, I'm an artificial intelligence enthusiast and I'm looking to build a mini PC dedicated to AI inference, particularly for machine translation of novels and light novels. I recently discovered the Aya-Expanse-8B model, which offers exceptional performance in English-to-French translation. My goal is to build a mini PC that can do very fast and energy-efficient inferencing to load models from 8B to 27B (up to the Gemma2-27B model). I'm aiming for a minimum of 40-50 tokens per second on the Aya-Expanse-8B model, so I can do light novel or novel machine translation efficiently. I'm aware that RAM bandwidth and GPU bandwidth are key factors for AI inference. So I'm looking for the best recommendations for the following components:
* CPU with an IGPU or NPU that would be relevant for AI inference. I don't know much about NPUs, but I'm wondering if it might allow me to do something functional at high speed. Can you give me some information on the pros and cons of NPUs for AI inference?
* RAM with high bandwidth to support large AI models. I've heard of the Smokeless-UMAF GitHub project that allows a lot of RAM to be allocated in the form of VRAM to the IGPU. Could this be a good solution for my configuration?
* Other components that could have an impact on AI inference performance.
I'm also looking for mini PCs with good cooling, as I plan to run my system for extended periods (4h to 8h continuously). Can you recommend any mini PCs with efficient cooling systems? I'd be delighted to receive your answers and recommendations for building a mini PC dedicated to AI inference. Thanks to the community for your advice and experience!
35 Comments
Downtown-Case-1755@reddit
zerostyle@reddit
Downtown-Case-1755@reddit
schlammsuhler@reddit
zerostyle@reddit
schlammsuhler@reddit
zerostyle@reddit
matadorius@reddit
Whiplashorus@reddit (OP)
matadorius@reddit
Whiplashorus@reddit (OP)
Mysterious_Finish543@reddit
zerostyle@reddit
NEEDMOREVRAM@reddit
Mysterious_Finish543@reddit
SignificantDress355@reddit
Whiplashorus@reddit (OP)
MoffKalast@reddit
matadorius@reddit
zerostyle@reddit
FullOf_Bad_Ideas@reddit
Whiplashorus@reddit (OP)
chrd5273@reddit
Whiplashorus@reddit (OP)
chrd5273@reddit
isr_431@reddit
Whiplashorus@reddit (OP)
isr_431@reddit
Whiplashorus@reddit (OP)
clduab11@reddit
Whiplashorus@reddit (OP)
clduab11@reddit
loadsamuny@reddit
ICanSeeYou7867@reddit
Whiplashorus@reddit (OP)