createthiscom

PC Build: Run Deepseek-V3-0324:671b-Q8 Locally 6-8 tok/s

Posted by createthiscom@reddit | LocalLLaMA | View on Reddit | 173 comments
native tool calling support for DeepSeek V3.1 just merged in llama.cpp

Posted by createthiscom@reddit | LocalLLaMA | View on Reddit | 10 comments
The dangers of local LLMs: Sleeper Agents

Posted by createthiscom@reddit | LocalLLaMA | View on Reddit | 24 comments
samsung 9100 pro 4tb vs WD_BLACK 8TB SN850X for llama.cpp

Posted by createthiscom@reddit | LocalLLaMA | View on Reddit | 25 comments
What's your favorite local model for C#?

Posted by createthiscom@reddit | LocalLLaMA | View on Reddit | 11 comments
large model ( 547G ) load time - llama.cpp

Posted by createthiscom@reddit | LocalLLaMA | View on Reddit | 1 comments
This is awkward

Posted by createthiscom@reddit | LocalLLaMA | View on Reddit | 188 comments
more than 131k context on a single GPU - llama.cpp

Posted by createthiscom@reddit | LocalLLaMA | View on Reddit | 3 comments
Why does HF not show total size for directories?

Posted by createthiscom@reddit | LocalLLaMA | View on Reddit | 4 comments
DIY LLM inference engine learning

Posted by createthiscom@reddit | LocalLLaMA | View on Reddit | 12 comments
Does your AI need help writing unified diffs?

Posted by createthiscom@reddit | LocalLLaMA | View on Reddit | 5 comments
ktransformers: DeepSeek_V3_0324:671b-Q4_K_M - 14 tok/s - Open Hands AI

Posted by createthiscom@reddit | LocalLLaMA | View on Reddit | 0 comments