jwpbe
-
Buried lede: Deepseek v4 Flash is incredibly inexpensive from the official API for its weight category
Posted by jwpbe@reddit | LocalLLaMA | View on Reddit | 64 comments
-
InclusionAI's 103B MoE's Ring-Flash 2.0 (Reasoning) and Ling-Flash 2.0 (Instruct) now have GGUFs!
Posted by jwpbe@reddit | LocalLLaMA | View on Reddit | 11 comments
-
tool calling support was merged into ik_llama last week
Posted by jwpbe@reddit | LocalLLaMA | View on Reddit | 3 comments