LM Studio CPU thread pool size vs. tk/s with some MoE layers offloaded to CPU
Posted by bonobomaster@reddit | LocalLLaMA | View on Reddit | 42 comments
Posted by bonobomaster@reddit | LocalLLaMA | View on Reddit | 42 comments
42 Comments
Plastic-Stress-6468@reddit
Clear-Ad-9312@reddit
Plastic-Stress-6468@reddit
tony__Y@reddit
FatheredPuma81@reddit
SnooPaintings8639@reddit
KageYume@reddit
mlhher@reddit
bonobomaster@reddit (OP)
Zc5Gwu@reddit
Plastic-Stress-6468@reddit
bonobomaster@reddit (OP)
bonobomaster@reddit (OP)
GreaterThanLess@reddit
bonobomaster@reddit (OP)
aaronr_90@reddit
bonobomaster@reddit (OP)
aaronr_90@reddit
bonobomaster@reddit (OP)
aaronr_90@reddit
denoflore_ai_guy@reddit
bonobomaster@reddit (OP)
denoflore_ai_guy@reddit
usuallyalurker11@reddit
bonobomaster@reddit (OP)
usuallyalurker11@reddit
Clear-Ad-9312@reddit
Wetbikeboy2500@reddit
Equivalent_Job_2257@reddit
bonobomaster@reddit (OP)
Equivalent_Job_2257@reddit
Iory1998@reddit
gigaflops_@reddit
MmmmMorphine@reddit
bonobomaster@reddit (OP)
_VirtualCosmos_@reddit
eesnimi@reddit
mp3m4k3r@reddit
bonobomaster@reddit (OP)
dreamai87@reddit
bonobomaster@reddit (OP)
moahmo88@reddit