charmander_cha

Offloading to SSD + speculative decoding via difussion = real deal?

Posted by charmander_cha@reddit | LocalLLaMA | View on Reddit | 8 comments
The search problem has been solved - SemanticFileFinder (sff)

Posted by charmander_cha@reddit | LocalLLaMA | View on Reddit | 3 comments
Anyone tried to use TextGrad with Ollama? (TextGrad recreates your prompt to be more accurate or anything about this)

Posted by charmander_cha@reddit | LocalLLaMA | View on Reddit | 4 comments
"Router mode is experimental" | llama.cpp now has a router mode and I didn't know.

Posted by charmander_cha@reddit | LocalLLaMA | View on Reddit | 10 comments
Is there some kind of file with all the information from the Comfyui documentation in markdown?

Posted by charmander_cha@reddit | LocalLLaMA | View on Reddit | 10 comments
What's the limits of vibe coding?

Posted by charmander_cha@reddit | LocalLLaMA | View on Reddit | 19 comments
BitNet a4.8: 4-bit Activations for 1-bit LLMs (bitnet evolution)

Posted by charmander_cha@reddit | LocalLLaMA | View on Reddit | 5 comments
You can use two models at the same time to evaluate who responds better in Qwen Chat

Posted by charmander_cha@reddit | LocalLLaMA | View on Reddit | 4 comments
A little scene I created using Qwen's new chat

Posted by charmander_cha@reddit | LocalLLaMA | View on Reddit | 3 comments
I created a voice assistant that can open games for me (if you can run openai-whisper you can run that)

Posted by charmander_cha@reddit | LocalLLaMA | View on Reddit | 2 comments
I created an entiry project with Claude + ChatGPT + Qwen: Automated Python Project Documentation Generator: Your New Code Analysis Companion (AMA)

Posted by charmander_cha@reddit | LocalLLaMA | View on Reddit | 15 comments
The Era of 1.58-bit Large Language Models: A Breakthrough in Efficiency - Mindplex

Posted by charmander_cha@reddit | LocalLLaMA | View on Reddit | 15 comments