charmander_cha
Microsoft Aion 1.0 Instruct and Aion 1.0 Plan models!
Posted by Mysterious_Finish543@reddit | LocalLLaMA | View on Reddit | 109 comments
What's the status of non-CUDA inference?
Posted by IngwiePhoenix@reddit | LocalLLaMA | View on Reddit | 30 comments
charmander_cha@reddit
StepFun 3.7 Flash
Posted by Everlier@reddit | LocalLLaMA | View on Reddit | 151 comments
charmander_cha@reddit
Looks like Miminax-M3 is just around the corner
Posted by OnkelBB@reddit | LocalLLaMA | View on Reddit | 40 comments
charmander_cha@reddit
[NEW] Supra-50M Released!
Posted by Dangerous_Try3619@reddit | LocalLLaMA | View on Reddit | 60 comments
charmander_cha@reddit
Waiting on Qwen to drop those 3.7 models be like:
Posted by Porespellar@reddit | LocalLLaMA | View on Reddit | 47 comments
charmander_cha@reddit
Carbon: Decoding the Language of Life
Posted by loubnabnl@reddit | LocalLLaMA | View on Reddit | 49 comments
charmander_cha@reddit
Rewrite Bun in Rust has been merged
Posted by gruenistblau@reddit | programming | View on Reddit | 412 comments
charmander_cha@reddit
GrapheneOS: Google's Play Integrity API requires hardware attestation ... Apple already has it as a requirement. Over the long term, this will increasingly lock out hardware and OS competition.
Posted by TheTwelveYearOld@reddit | linux | View on Reddit | 245 comments
charmander_cha@reddit
Any news (or hope) of Qwen-3.6 14B and 9B distills for local coding ?
Posted by QuchchenEbrithin2day@reddit | LocalLLaMA | View on Reddit | 54 comments
charmander_cha@reddit
Any news (or hope) of Qwen-3.6 14B and 9B distills for local coding ?
Posted by QuchchenEbrithin2day@reddit | LocalLLaMA | View on Reddit | 54 comments
charmander_cha@reddit
You can now read Gemma 3's mind
Posted by DigiDecode_@reddit | LocalLLaMA | View on Reddit | 20 comments
charmander_cha@reddit
HOT TAKE: local models + agent harnesses are now capable enough to hand off junior-level IT professional tasks to [human written]
Posted by Porespellar@reddit | LocalLLaMA | View on Reddit | 67 comments
charmander_cha@reddit
Introducing SubQ: The First Fully Subquadratic LLM
Posted by hltt@reddit | LocalLLaMA | View on Reddit | 6 comments
charmander_cha@reddit
Bun is being rewritten to Rust
Posted by aabbdev@reddit | programming | View on Reddit | 152 comments
charmander_cha@reddit
Open source models are going to be the future on Cursor, OpenCode etc.
Posted by _maverick98@reddit | LocalLLaMA | View on Reddit | 148 comments
charmander_cha@reddit
Qwen 3.6 seems to have a lot of trouble with tool calling
Posted by Perfect-Campaign9551@reddit | LocalLLaMA | View on Reddit | 58 comments
charmander_cha@reddit
Qwen Models are such good models?
Posted by FeiX7@reddit | LocalLLaMA | View on Reddit | 27 comments
charmander_cha@reddit
Meta’s $2 billion Manus acquisition blocked by China.
Posted by Nunki08@reddit | LocalLLaMA | View on Reddit | 97 comments
charmander_cha@reddit
Meta’s $2 billion Manus acquisition blocked by China.
Posted by Nunki08@reddit | LocalLLaMA | View on Reddit | 97 comments
charmander_cha@reddit
Opencode-power-pack – Claude Code skills ported to OpenCode
Posted by waybarrios@reddit | LocalLLaMA | View on Reddit | 8 comments
charmander_cha@reddit
Qwen 3.6 27B Makes Huge Gains in Agency on Artificial Analysis - Ties with Sonnet 4.6
Posted by dionysio211@reddit | LocalLLaMA | View on Reddit | 177 comments
charmander_cha@reddit
Qwen3 TTS is seriously underrated - I got it running locally in real-time and it's one of the most expressive open TTS models I've tried
Posted by fagenorn@reddit | LocalLLaMA | View on Reddit | 117 comments
charmander_cha@reddit
US gov memo on “adversarial distillation” - are we heading toward tighter controls on open models?
Posted by MLExpert000@reddit | LocalLLaMA | View on Reddit | 399 comments
charmander_cha@reddit
Qwen-3.6-27B, llamacpp, speculative decoding - appreciation post
Posted by Then-Topic8766@reddit | LocalLLaMA | View on Reddit | 95 comments
charmander_cha@reddit
Qwen3 TTS is seriously underrated - I got it running locally in real-time and it's one of the most expressive open TTS models I've tried
Posted by fagenorn@reddit | LocalLLaMA | View on Reddit | 117 comments
charmander_cha@reddit
I don’t believe this benchmark 27b size model next opus 4.5! Anyone can confirm testing with real agentic workflow?
Posted by Wonderful-Ad-5952@reddit | LocalLLaMA | View on Reddit | 50 comments
charmander_cha@reddit
MIT & the IMO released MathNet, the world’s largest dataset of International Math Olympiad problems & solutions. MathNet is 5x larger than previous datasets & is sourced from over 40 countries across 4 decades
Posted by Nunki08@reddit | LocalLLaMA | View on Reddit | 5 comments
charmander_cha@reddit
Qwen 3.6 27B is out
Posted by NoConcert8847@reddit | LocalLLaMA | View on Reddit | 609 comments
charmander_cha@reddit
llama.cpp is the linux of llm
Posted by DevelopmentBorn3978@reddit | LocalLLaMA | View on Reddit | 96 comments
charmander_cha@reddit
Why doesn't any OSS tool treat llama.cpp as a first class citizen?
Posted by rm-rf-rm@reddit | LocalLLaMA | View on Reddit | 120 comments
charmander_cha@reddit
When did Github stop being about Git?
Posted by dgkimpton@reddit | programming | View on Reddit | 134 comments
charmander_cha@reddit
Qwen3.6 (35B-A3B) with OpenCode. Running locally with llama.cpp
Posted by curiousily_@reddit | LocalLLaMA | View on Reddit | 5 comments
charmander_cha@reddit
Are you guys actually using local tool calling or is it a collective prank?
Posted by Mayion@reddit | LocalLLaMA | View on Reddit | 197 comments
charmander_cha@reddit
When is Qwen 3.6 27B dropping? Didn’t it win the vote?
Posted by GrungeWerX@reddit | LocalLLaMA | View on Reddit | 72 comments
charmander_cha@reddit
Don't ask Qwen 3.6 35b to give you aski image of Yoshi :)
Posted by anzzax@reddit | LocalLLaMA | View on Reddit | 19 comments
charmander_cha@reddit
Qwen3.6. This is it.
Posted by Local-Cardiologist-5@reddit | LocalLLaMA | View on Reddit | 420 comments
charmander_cha@reddit
Bonsai models are pure hype: Bonsai-8B is MUCH dumber than Gemma-4-E2B
Posted by WeGoToMars7@reddit | LocalLLaMA | View on Reddit | 71 comments
charmander_cha@reddit
Ternary Bonsai: Top intelligence at 1.58 bits
Posted by pmttyji@reddit | LocalLLaMA | View on Reddit | 89 comments
charmander_cha@reddit
Ternary Bonsai: Top intelligence at 1.58 bits
Posted by pmttyji@reddit | LocalLLaMA | View on Reddit | 89 comments
charmander_cha@reddit
Ternary Bonsai: Top intelligence at 1.58 bits
Posted by pmttyji@reddit | LocalLLaMA | View on Reddit | 89 comments
charmander_cha@reddit
Comparison Qwen 3.6 35B MoE vs Qwen 3.5 35B MoE on Research Paper to WebApp
Posted by dreamai87@reddit | LocalLLaMA | View on Reddit | 29 comments
charmander_cha@reddit
Its just a new Qwen model
Posted by StandardLovers@reddit | LocalLLaMA | View on Reddit | 21 comments
charmander_cha@reddit
VAD issues - takes too much time to understand when the user has stopped talking
Posted by Male_Cat_@reddit | LocalLLaMA | View on Reddit | 9 comments
charmander_cha@reddit
(llama.cpp) Possible to disable reasoning for some requests (while leaving reasoning on by default)?
Posted by regunakyle@reddit | LocalLLaMA | View on Reddit | 21 comments
charmander_cha@reddit
(llama.cpp) Possible to disable reasoning for some requests (while leaving reasoning on by default)?
Posted by regunakyle@reddit | LocalLLaMA | View on Reddit | 21 comments
charmander_cha@reddit
BlueTTS is basically supertonic look at the paper and the code
Posted by Elegant-Condition206@reddit | LocalLLaMA | View on Reddit | 6 comments
charmander_cha@reddit
huge improvement after moving from ollama to llama.cpp
Posted by leonardosalvatore@reddit | LocalLLaMA | View on Reddit | 76 comments
charmander_cha@reddit
It was fun while it lasted...
Posted by Specter_Origin@reddit | LocalLLaMA | View on Reddit | 15 comments
charmander_cha@reddit
It was fun while it lasted...
Posted by Specter_Origin@reddit | LocalLLaMA | View on Reddit | 15 comments