Dany0
-
(YT) PewDiePie released his harness/webui
Posted by Dany0@reddit | LocalLLaMA | View on Reddit | 450 comments
-
Important (vision) Qwen3.5 template fix dropped in vllm
Posted by Dany0@reddit | LocalLLaMA | View on Reddit | 2 comments
-
attn-rot (TurboQuant-like KV cache trick) lands in llama.cpp
Posted by Dany0@reddit | LocalLLaMA | View on Reddit | 28 comments
-
attn-rot (ggerganov's "TurboQuant lite") is on the cusp of getting merged into llama.cpp
Posted by Dany0@reddit | LocalLLaMA | View on Reddit | 67 comments
-
First Qwen3-Coder-Next REAP is out
Posted by Dany0@reddit | LocalLLaMA | View on Reddit | 78 comments