charmander_cha
-
Offloading to SSD + speculative decoding via difussion = real deal?
Posted by charmander_cha@reddit | LocalLLaMA | View on Reddit | 8 comments
-
The search problem has been solved - SemanticFileFinder (sff)
Posted by charmander_cha@reddit | LocalLLaMA | View on Reddit | 3 comments
-
Anyone tried to use TextGrad with Ollama? (TextGrad recreates your prompt to be more accurate or anything about this)
Posted by charmander_cha@reddit | LocalLLaMA | View on Reddit | 4 comments
-
"Router mode is experimental" | llama.cpp now has a router mode and I didn't know.
Posted by charmander_cha@reddit | LocalLLaMA | View on Reddit | 10 comments
-
Is there some kind of file with all the information from the Comfyui documentation in markdown?
Posted by charmander_cha@reddit | LocalLLaMA | View on Reddit | 10 comments
-
What's the limits of vibe coding?
Posted by charmander_cha@reddit | LocalLLaMA | View on Reddit | 19 comments
-
BitNet a4.8: 4-bit Activations for 1-bit LLMs (bitnet evolution)
Posted by charmander_cha@reddit | LocalLLaMA | View on Reddit | 5 comments
-
You can use two models at the same time to evaluate who responds better in Qwen Chat
Posted by charmander_cha@reddit | LocalLLaMA | View on Reddit | 4 comments
-
A little scene I created using Qwen's new chat
Posted by charmander_cha@reddit | LocalLLaMA | View on Reddit | 3 comments
-
I created a voice assistant that can open games for me (if you can run openai-whisper you can run that)
Posted by charmander_cha@reddit | LocalLLaMA | View on Reddit | 2 comments
-
I created an entiry project with Claude + ChatGPT + Qwen: Automated Python Project Documentation Generator: Your New Code Analysis Companion (AMA)
Posted by charmander_cha@reddit | LocalLLaMA | View on Reddit | 15 comments
-
The Era of 1.58-bit Large Language Models: A Breakthrough in Efficiency - Mindplex
Posted by charmander_cha@reddit | LocalLLaMA | View on Reddit | 15 comments