EmilPi

Which (1 or 2-story) frame to use for 7 GPU rig?

Posted by EmilPi@reddit | LocalLLaMA | View on Reddit | 5 comments
Anyone using RTX 8000 (48GB) or MI100 (32GB) cards for LLM inference?

Posted by EmilPi@reddit | LocalLLaMA | View on Reddit | 44 comments
How to estimate prompt processing speed for given (multi-)GPU and model?

Posted by EmilPi@reddit | LocalLLaMA | View on Reddit | 2 comments
How to run Gemma 3 27B QAT with 128k context window with 3 parallel requests possible on 2x3090

Posted by EmilPi@reddit | LocalLLaMA | View on Reddit | 6 comments
5 commands to run Qwen3-235B-A22B Q3 inference on 4x3090 + 32-core TR + 192GB DDR4 RAM

Posted by EmilPi@reddit | LocalLLaMA | View on Reddit | 37 comments
How to use Qwen2.5-Coder-Instruct without frustration in the meantime

Posted by EmilPi@reddit | LocalLLaMA | View on Reddit | 32 comments
Is RTX 50xx series intentionally locked for compute / AI ?

Posted by EmilPi@reddit | LocalLLaMA | View on Reddit | 19 comments
2x RTX 3090 + Threadripper 3970X + 256GB RAM LLM inference benchmarks

Posted by EmilPi@reddit | LocalLLaMA | View on Reddit | 57 comments
What is best small long-context open-weight model now?

Posted by EmilPi@reddit | LocalLLaMA | View on Reddit | 15 comments
PCIe splitter advice

Posted by EmilPi@reddit | LocalLLaMA | View on Reddit | 6 comments
A4500 (20 GB VRAM) + RTX 4060 Ti (16 GB) dual GPU setup

Posted by EmilPi@reddit | buildapc | View on Reddit | 1 comments
Dual RTX 4070 Ti Super vs RTX 4090

Posted by EmilPi@reddit | buildapc | View on Reddit | 2 comments
Run DeepSeek-V3 with 96GB VRAM + 256 GB RAM under Linux

Posted by EmilPi@reddit | LocalLLaMA | View on Reddit | 27 comments
Any Pixtral-Large-2411 quants over there?

Posted by EmilPi@reddit | LocalLLaMA | View on Reddit | 7 comments
How to simply run your model at startup in Debian/Ubuntu

Posted by EmilPi@reddit | LocalLLaMA | View on Reddit | 8 comments
4x RTX 3090 + Threadripper 3970X + 256 GB RAM LLM inference benchmarks

Posted by EmilPi@reddit | LocalLLaMA | View on Reddit | 49 comments
LLM 4 GPUs rig stability problem

Posted by EmilPi@reddit | LocalLLaMA | View on Reddit | 22 comments
4 GPUs stability problem

Posted by EmilPi@reddit | buildapc | View on Reddit | 0 comments
Code LLMs with long context window

Posted by EmilPi@reddit | LocalLLaMA | View on Reddit | 32 comments
How to know RAM Threaded Read Speed in advance given RAM module, stick count, mobo's RAM channels count and CPU's PCIe lanes count and gen?

Posted by EmilPi@reddit | buildapc | View on Reddit | 3 comments
Re: asked LLMs a riddle about Sally and her sisters + plot twist

Posted by EmilPi@reddit | LocalLLaMA | View on Reddit | 1 comments