anthonyg45157
How much VRAM needed for Qwen 3.6 27B Q8 with 262K context?
Posted by My_Unbiased_Opinion@reddit | LocalLLaMA | View on Reddit | 79 comments
Replaced Claude with local Qwen3.6-27B in my multi-agent orchestrator for 2 weeks
Posted by Interesting-Sock3940@reddit | LocalLLaMA | View on Reddit | 147 comments
anthonyg45157@reddit
Replaced Claude with local Qwen3.6-27B in my multi-agent orchestrator for 2 weeks
Posted by Interesting-Sock3940@reddit | LocalLLaMA | View on Reddit | 147 comments
anthonyg45157@reddit
Replaced Claude with local Qwen3.6-27B in my multi-agent orchestrator for 2 weeks
Posted by Interesting-Sock3940@reddit | LocalLLaMA | View on Reddit | 147 comments
anthonyg45157@reddit
Replaced Claude with local Qwen3.6-27B in my multi-agent orchestrator for 2 weeks
Posted by Interesting-Sock3940@reddit | LocalLLaMA | View on Reddit | 147 comments
anthonyg45157@reddit
Stop asking what model to run. There are literally only two.
Posted by Wrong_Mushroom_7350@reddit | LocalLLaMA | View on Reddit | 549 comments
anthonyg45157@reddit
Stop asking what model to run. There are literally only two.
Posted by Wrong_Mushroom_7350@reddit | LocalLLaMA | View on Reddit | 549 comments
anthonyg45157@reddit
llama: use f16 mask for FA to save VRAM by am17an Β· Pull Request #23764 Β· ggml-org/llama.cpp
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 78 comments
anthonyg45157@reddit
For everyone that uses OpenCode / Pi - Heres your promptprocessing fix!
Posted by No_Algae1753@reddit | LocalLLaMA | View on Reddit | 40 comments
anthonyg45157@reddit
Move to backend sampling for MTP draft path by gaugarg-nv Β· Pull Request #23287 Β· ggml-org/llama.cpp
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 37 comments
anthonyg45157@reddit
Thoughts on "production" model setups
Posted by fuse1921@reddit | LocalLLaMA | View on Reddit | 14 comments
anthonyg45157@reddit
Thoughts on "production" model setups
Posted by fuse1921@reddit | LocalLLaMA | View on Reddit | 14 comments
anthonyg45157@reddit
What llamacpp's webui has and what it lacks
Posted by gigachad_deluxe@reddit | LocalLLaMA | View on Reddit | 25 comments
anthonyg45157@reddit
What llamacpp's webui has and what it lacks
Posted by gigachad_deluxe@reddit | LocalLLaMA | View on Reddit | 25 comments
anthonyg45157@reddit
What llamacpp's webui has and what it lacks
Posted by gigachad_deluxe@reddit | LocalLLaMA | View on Reddit | 25 comments
anthonyg45157@reddit
Got MTP + TurboQuant running β Qwen3.6-27B -- 80+ t/s at 262K context on a single RTX 4090
Posted by indrasmirror@reddit | LocalLLaMA | View on Reddit | 76 comments
anthonyg45157@reddit
Got MTP + TurboQuant running β Qwen3.6-27B -- 80+ t/s at 262K context on a single RTX 4090
Posted by indrasmirror@reddit | LocalLLaMA | View on Reddit | 76 comments
anthonyg45157@reddit
Got MTP + TurboQuant running β Qwen3.6-27B -- 80+ t/s at 262K context on a single RTX 4090
Posted by indrasmirror@reddit | LocalLLaMA | View on Reddit | 76 comments
anthonyg45157@reddit
Got MTP + TurboQuant running β Qwen3.6-27B -- 80+ t/s at 262K context on a single RTX 4090
Posted by indrasmirror@reddit | LocalLLaMA | View on Reddit | 76 comments
anthonyg45157@reddit
AMA with Nous Research -- Ask Us Anything!
Posted by emozilla@reddit | LocalLLaMA | View on Reddit | 399 comments
anthonyg45157@reddit
AMA with Nous Research -- Ask Us Anything!
Posted by emozilla@reddit | LocalLLaMA | View on Reddit | 399 comments
anthonyg45157@reddit
AMA with Nous Research -- Ask Us Anything!
Posted by emozilla@reddit | LocalLLaMA | View on Reddit | 399 comments
anthonyg45157@reddit
AMA with Nous Research -- Ask Us Anything!
Posted by emozilla@reddit | LocalLLaMA | View on Reddit | 399 comments
anthonyg45157@reddit
Luce DFlash: Qwen3.6-27B at up to 2x throughput on a single RTX 3090
Posted by sandropuppo@reddit | LocalLLaMA | View on Reddit | 182 comments
anthonyg45157@reddit
An Overnight Stack for Qwen3.6β27B: 85 TPS, 125K Context, Vision β on One RTX 3090 | by Wasif Basharat | Apr, 2026
Posted by AmazingDrivers4u@reddit | LocalLLaMA | View on Reddit | 177 comments
anthonyg45157@reddit
Best config for Qwen3.6 27b / llama.cpp / opencode
Posted by Familiar_Wish1132@reddit | LocalLLaMA | View on Reddit | 110 comments
anthonyg45157@reddit
Best config for Qwen3.6 27b / llama.cpp / opencode
Posted by Familiar_Wish1132@reddit | LocalLLaMA | View on Reddit | 110 comments
anthonyg45157@reddit
Best config for Qwen3.6 27b / llama.cpp / opencode
Posted by Familiar_Wish1132@reddit | LocalLLaMA | View on Reddit | 110 comments
anthonyg45157@reddit
Best config for Qwen3.6 27b / llama.cpp / opencode
Posted by Familiar_Wish1132@reddit | LocalLLaMA | View on Reddit | 110 comments
anthonyg45157@reddit
Waiting Qwen3.6-27B I have no nails left...
Posted by DOAMOD@reddit | LocalLLaMA | View on Reddit | 95 comments
anthonyg45157@reddit
Qwen3.6 is incredible with OpenCode!
Posted by CountlessFlies@reddit | LocalLLaMA | View on Reddit | 166 comments
anthonyg45157@reddit
GPU advice for Qwen 3.5 27B / Gemma 4 31B (dense) β aiming for 64K ctx, 30+ t/s
Posted by Fit-Courage5400@reddit | LocalLLaMA | View on Reddit | 96 comments
anthonyg45157@reddit
Car-wash question and Qwen3.5-27b-Q6
Posted by KringleKrispi@reddit | LocalLLaMA | View on Reddit | 20 comments
anthonyg45157@reddit
If you haven't yet given Gemma 4 a go...do it today
Posted by No-Anchovies@reddit | LocalLLaMA | View on Reddit | 206 comments
anthonyg45157@reddit
Gemma 4 26B A4B is still fully capable at 245283/262144 (94%) contex !
Posted by cviperr33@reddit | LocalLLaMA | View on Reddit | 108 comments
anthonyg45157@reddit
Gemma 4 26B A4B is still fully capable at 245283/262144 (94%) contex !
Posted by cviperr33@reddit | LocalLLaMA | View on Reddit | 108 comments
anthonyg45157@reddit
Final voting results for Qwen 3.6
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 285 comments
anthonyg45157@reddit
Final voting results for Qwen 3.6
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 285 comments
anthonyg45157@reddit
Final voting results for Qwen 3.6
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 285 comments
anthonyg45157@reddit
Local Claude Code with Qwen3.5 27B
Posted by FeiX7@reddit | LocalLLaMA | View on Reddit | 122 comments
anthonyg45157@reddit
I donβt get it. Why would Facebook acquire Moltbook? Are their engineers too busy recording a day in the life of a meta engineer and cannot build it in a week or so?!
Posted by SilverRegion9394@reddit | LocalLLaMA | View on Reddit | 84 comments
anthonyg45157@reddit
We collected 135 phrases Whisper hallucinates during silence β here's what it says when nobody's talking and how we stopped it
Posted by Aggravating-Gap7783@reddit | LocalLLaMA | View on Reddit | 95 comments
anthonyg45157@reddit
We collected 135 phrases Whisper hallucinates during silence β here's what it says when nobody's talking and how we stopped it
Posted by Aggravating-Gap7783@reddit | LocalLLaMA | View on Reddit | 95 comments
anthonyg45157@reddit
PSA: Humans are scary stupid
Posted by rm-rf-rm@reddit | LocalLLaMA | View on Reddit | 204 comments
anthonyg45157@reddit
why is openclaw even this popular?
Posted by Crazyscientist1024@reddit | LocalLLaMA | View on Reddit | 320 comments
anthonyg45157@reddit
why is openclaw even this popular?
Posted by Crazyscientist1024@reddit | LocalLLaMA | View on Reddit | 320 comments
anthonyg45157@reddit
why is openclaw even this popular?
Posted by Crazyscientist1024@reddit | LocalLLaMA | View on Reddit | 320 comments
anthonyg45157@reddit
why is openclaw even this popular?
Posted by Crazyscientist1024@reddit | LocalLLaMA | View on Reddit | 320 comments
anthonyg45157@reddit
why is openclaw even this popular?
Posted by Crazyscientist1024@reddit | LocalLLaMA | View on Reddit | 320 comments
anthonyg45157@reddit
LM Link
Posted by Blindax@reddit | LocalLLaMA | View on Reddit | 40 comments