d00m_sayer

VLLM Prefix caching cannot be used with Qwen 3.5 27b ?

Posted by d00m_sayer@reddit | LocalLLaMA | View on Reddit | 6 comments
Dissatisfied with how the RTX PRO 6000 Blackwell is performing during AI inference

Posted by d00m_sayer@reddit | LocalLLaMA | View on Reddit | 63 comments
Is there any feasible modification that would allow an RTX 6000 to support an NVLink bridge?

Posted by d00m_sayer@reddit | LocalLLaMA | View on Reddit | 17 comments
Question about "./llama-server" prompt caching

Posted by d00m_sayer@reddit | LocalLLaMA | View on Reddit | 16 comments
Can PCIE X16 Gen4 SlimSAS 8i x2 Adapters be powered by a second PSU ? or does it need the same PSU that powers the motherboard ?

Posted by d00m_sayer@reddit | LocalLLaMA | View on Reddit | 12 comments
Can PCIE X16 Gen4 SlimSAS 8i x2 Adapters be powered by a second PSU ? or does it need the same PSU that powers the motherboard ?

Posted by d00m_sayer@reddit | buildapc | View on Reddit | 1 comments
Powering a Rig with Mixed PSUs

Posted by d00m_sayer@reddit | LocalLLaMA | View on Reddit | 4 comments
How do I supply power to a PCIe x16 to x8x8 bifurcation card?

Posted by d00m_sayer@reddit | LocalLLaMA | View on Reddit | 7 comments
llama.cpp is unusable for real work

Posted by d00m_sayer@reddit | LocalLLaMA | View on Reddit | 19 comments
Tesnor Parallelism issues

Posted by d00m_sayer@reddit | LocalLLaMA | View on Reddit | 3 comments
How to make DeepSeek-R1-Distill-Llama-70B Model reason?

Posted by d00m_sayer@reddit | LocalLLaMA | View on Reddit | 2 comments
Extreme heat in Saudi Arabia leads a gas station's gasoline storage tanks to dramatically burst.

Posted by d00m_sayer@reddit | collapse | View on Reddit | 1 comments