PaMRxR
Can't get over 250TPS on RTX5090 with Qwen3.5-4B
Posted by luckyj@reddit | LocalLLaMA | View on Reddit | 30 comments
Dual 3090 setup - performance optimization
Posted by PaMRxR@reddit | LocalLLaMA | View on Reddit | 45 comments
PaMRxR@reddit (OP)
Dual 3090 setup - performance optimization
Posted by PaMRxR@reddit | LocalLLaMA | View on Reddit | 45 comments
PaMRxR@reddit (OP)
Dual 3090 setup - performance optimization
Posted by PaMRxR@reddit | LocalLLaMA | View on Reddit | 45 comments
PaMRxR@reddit (OP)
I'm done with using local LLMs for coding
Posted by dtdisapointingresult@reddit | LocalLLaMA | View on Reddit | 810 comments
PaMRxR@reddit
I'm done with using local LLMs for coding
Posted by dtdisapointingresult@reddit | LocalLLaMA | View on Reddit | 810 comments
PaMRxR@reddit
Every time a new model comes out, the old one is obsolete of course
Posted by FullChampionship7564@reddit | LocalLLaMA | View on Reddit | 198 comments
PaMRxR@reddit
Qwen 3.6 35B crushes Gemma 4 26B on my tests
Posted by Lowkey_LokiSN@reddit | LocalLLaMA | View on Reddit | 116 comments
PaMRxR@reddit
Guys we have to change the pelican test
Posted by Tall-Ad-7742@reddit | LocalLLaMA | View on Reddit | 93 comments
PaMRxR@reddit
Guys we have to change the pelican test
Posted by Tall-Ad-7742@reddit | LocalLLaMA | View on Reddit | 93 comments
PaMRxR@reddit
Updated Qwen3.5-9B Quantization Comparison
Posted by TitwitMuffbiscuit@reddit | LocalLLaMA | View on Reddit | 106 comments
PaMRxR@reddit
Speculative Decoding works great for Gemma 4 31B with E2B draft (+29% avg, +50% on code)
Posted by PerceptionGrouchy187@reddit | LocalLLaMA | View on Reddit | 117 comments
PaMRxR@reddit
Speculative Decoding works great for Gemma 4 31B with E2B draft (+29% avg, +50% on code)
Posted by PerceptionGrouchy187@reddit | LocalLLaMA | View on Reddit | 117 comments
PaMRxR@reddit
Speculative Decoding works great for Gemma 4 31B with E2B draft (+29% avg, +50% on code)
Posted by PerceptionGrouchy187@reddit | LocalLLaMA | View on Reddit | 117 comments
PaMRxR@reddit
Weekend project with Intel B70s
Posted by dev_is_active@reddit | LocalLLaMA | View on Reddit | 41 comments
PaMRxR@reddit
Dual 3090 setup - performance optimization
Posted by PaMRxR@reddit | LocalLLaMA | View on Reddit | 45 comments
PaMRxR@reddit (OP)
Dual 3090 setup - performance optimization
Posted by PaMRxR@reddit | LocalLLaMA | View on Reddit | 45 comments
PaMRxR@reddit (OP)
Dual 3090 setup - performance optimization
Posted by PaMRxR@reddit | LocalLLaMA | View on Reddit | 45 comments
PaMRxR@reddit (OP)
Dual 3090 setup - performance optimization
Posted by PaMRxR@reddit | LocalLLaMA | View on Reddit | 45 comments
PaMRxR@reddit (OP)
Dual 3090 setup - performance optimization
Posted by PaMRxR@reddit | LocalLLaMA | View on Reddit | 45 comments
PaMRxR@reddit (OP)
Dual 3090 setup - performance optimization
Posted by PaMRxR@reddit | LocalLLaMA | View on Reddit | 45 comments
PaMRxR@reddit (OP)
Dual 3090 setup - performance optimization
Posted by PaMRxR@reddit | LocalLLaMA | View on Reddit | 45 comments
PaMRxR@reddit (OP)
Dual 3090 setup - performance optimization
Posted by PaMRxR@reddit | LocalLLaMA | View on Reddit | 45 comments
PaMRxR@reddit (OP)
Dual 3090 setup - performance optimization
Posted by PaMRxR@reddit | LocalLLaMA | View on Reddit | 45 comments
PaMRxR@reddit (OP)
96GB Vram. What to run in 2026?
Posted by inthesearchof@reddit | LocalLLaMA | View on Reddit | 88 comments
PaMRxR@reddit
ASUS X99-E WS with 2x 3090. Anyone was able to set it up?
Posted by novanet-central@reddit | LocalLLaMA | View on Reddit | 6 comments
PaMRxR@reddit
Qwen3.5-35B-A3B Q4 Performance on Intel Arc B60?
Posted by LeDynamique@reddit | LocalLLaMA | View on Reddit | 5 comments
PaMRxR@reddit
Update on Qwen 3.5 35B A3B on Raspberry PI 5
Posted by jslominski@reddit | LocalLLaMA | View on Reddit | 37 comments
PaMRxR@reddit
Update on Qwen 3.5 35B A3B on Raspberry PI 5
Posted by jslominski@reddit | LocalLLaMA | View on Reddit | 37 comments
PaMRxR@reddit
Qwen3.5 2b, 4b and 9b tested on Raspberry Pi5
Posted by jslominski@reddit | LocalLLaMA | View on Reddit | 36 comments
PaMRxR@reddit
Qwen3.5 2b, 4b and 9b tested on Raspberry Pi5
Posted by jslominski@reddit | LocalLLaMA | View on Reddit | 36 comments
PaMRxR@reddit
Qwen3.5-27B Q4 Quantization Comparison
Posted by TitwitMuffbiscuit@reddit | LocalLLaMA | View on Reddit | 116 comments
PaMRxR@reddit
Qwen3.5-27B Q4 Quantization Comparison
Posted by TitwitMuffbiscuit@reddit | LocalLLaMA | View on Reddit | 116 comments
PaMRxR@reddit
Qwen3.5-27B Q4 Quantization Comparison
Posted by TitwitMuffbiscuit@reddit | LocalLLaMA | View on Reddit | 116 comments
PaMRxR@reddit
Update on Qwen 3.5 35B A3B on Raspberry PI 5
Posted by jslominski@reddit | LocalLLaMA | View on Reddit | 37 comments
PaMRxR@reddit
What tokens/sec do you get when running Qwen 3.5 27B?
Posted by thegr8anand@reddit | LocalLLaMA | View on Reddit | 194 comments
PaMRxR@reddit
Open WebUI’s New Open Terminal + “Native” Tool Calling + Qwen3.5 35b = Holy Sh!t!!!
Posted by Porespellar@reddit | LocalLLaMA | View on Reddit | 221 comments
PaMRxR@reddit
Open WebUI’s New Open Terminal + “Native” Tool Calling + Qwen3.5 35b = Holy Sh!t!!!
Posted by Porespellar@reddit | LocalLLaMA | View on Reddit | 221 comments
PaMRxR@reddit
What tokens/sec do you get when running Qwen 3.5 27B?
Posted by thegr8anand@reddit | LocalLLaMA | View on Reddit | 194 comments
PaMRxR@reddit
Open WebUI’s New Open Terminal + “Native” Tool Calling + Qwen3.5 35b = Holy Sh!t!!!
Posted by Porespellar@reddit | LocalLLaMA | View on Reddit | 221 comments
PaMRxR@reddit
Open WebUI’s New Open Terminal + “Native” Tool Calling + Qwen3.5 35b = Holy Sh!t!!!
Posted by Porespellar@reddit | LocalLLaMA | View on Reddit | 221 comments
PaMRxR@reddit
Open WebUI’s New Open Terminal + “Native” Tool Calling + Qwen3.5 35b = Holy Sh!t!!!
Posted by Porespellar@reddit | LocalLLaMA | View on Reddit | 221 comments
PaMRxR@reddit
Open WebUI’s New Open Terminal + “Native” Tool Calling + Qwen3.5 35b = Holy Sh!t!!!
Posted by Porespellar@reddit | LocalLLaMA | View on Reddit | 221 comments
PaMRxR@reddit
Qwen3.5-27B Q4 Quantization Comparison
Posted by TitwitMuffbiscuit@reddit | LocalLLaMA | View on Reddit | 116 comments
PaMRxR@reddit
Qwen3.5-27B Q4 Quantization Comparison
Posted by TitwitMuffbiscuit@reddit | LocalLLaMA | View on Reddit | 116 comments
PaMRxR@reddit
Qwen 3.5 Plus(397b-a17b) is now available on Chinese Qwen APP
Posted by AaronFeng47@reddit | LocalLLaMA | View on Reddit | 22 comments
PaMRxR@reddit
local vibe coding
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 146 comments
PaMRxR@reddit
Coding agent for local LLMs?
Posted by PaMRxR@reddit | LocalLLaMA | View on Reddit | 18 comments
PaMRxR@reddit (OP)
Coding agent for local LLMs?
Posted by PaMRxR@reddit | LocalLLaMA | View on Reddit | 18 comments
PaMRxR@reddit (OP)
Qwen/Qwen3-Coder-Next · Hugging Face
Posted by coder543@reddit | LocalLLaMA | View on Reddit | 248 comments