nickless07
gemma-4-12b-it vs Qwen3.5-9B on shared benchmarks: Qwen is overall winner beating gemma in 5/8 benchmarks despite a smaller footprint
Posted by fulgencio_batista@reddit | LocalLLaMA | View on Reddit | 116 comments
google/gemma-4-12B · Hugging Face
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 276 comments
nickless07@reddit
google/gemma-4-12B · Hugging Face
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 276 comments
nickless07@reddit
google/gemma-4-12B · Hugging Face
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 276 comments
nickless07@reddit
google/gemma-4-12B · Hugging Face
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 276 comments
nickless07@reddit
Stop asking what model to run. There are literally only two.
Posted by Wrong_Mushroom_7350@reddit | LocalLLaMA | View on Reddit | 569 comments
nickless07@reddit
Misunderstanding memory usage - 11.68gb quantized model takes up 22gb of RAM?
Posted by NotARedditUser3@reddit | LocalLLaMA | View on Reddit | 17 comments
nickless07@reddit
Misunderstanding memory usage - 11.68gb quantized model takes up 22gb of RAM?
Posted by NotARedditUser3@reddit | LocalLLaMA | View on Reddit | 17 comments
nickless07@reddit
Misunderstanding memory usage - 11.68gb quantized model takes up 22gb of RAM?
Posted by NotARedditUser3@reddit | LocalLLaMA | View on Reddit | 17 comments
nickless07@reddit
How do I improve my T/S
Posted by KneelB4S8n@reddit | LocalLLaMA | View on Reddit | 9 comments
nickless07@reddit
For those creating personal assistants locally - how has short/long term memory impacted your experience?
Posted by GrungeWerX@reddit | LocalLLaMA | View on Reddit | 50 comments
nickless07@reddit
Qwen3.6-35B-A3B-APEX / 128K ctx on RTX 3060 12GB — 37 t/s gen with 72k ctx filled, PPL 3.25, offloading 17GB model
Posted by old-mike@reddit | LocalLLaMA | View on Reddit | 41 comments
nickless07@reddit
Qwen3.6-35B-A3B-APEX / 128K ctx on RTX 3060 12GB — 37 t/s gen with 72k ctx filled, PPL 3.25, offloading 17GB model
Posted by old-mike@reddit | LocalLLaMA | View on Reddit | 41 comments
nickless07@reddit
Is there any case of a less quantised smaller model outperforming a more quantised larger model?
Posted by opoot_@reddit | LocalLLaMA | View on Reddit | 25 comments
nickless07@reddit
Is there any case of a less quantised smaller model outperforming a more quantised larger model?
Posted by opoot_@reddit | LocalLLaMA | View on Reddit | 25 comments
nickless07@reddit
Are GPU prices hitting peak and falling?
Posted by DistanceSolar1449@reddit | LocalLLaMA | View on Reddit | 39 comments
nickless07@reddit
Are GPU prices hitting peak and falling?
Posted by DistanceSolar1449@reddit | LocalLLaMA | View on Reddit | 39 comments
nickless07@reddit
qwen3.6-35b-a3b-mtp running on GTX 1060 6GB
Posted by xxvegas@reddit | LocalLLaMA | View on Reddit | 11 comments
nickless07@reddit
qwen3.6-35b-a3b-mtp running on GTX 1060 6GB
Posted by xxvegas@reddit | LocalLLaMA | View on Reddit | 11 comments
nickless07@reddit
qwen3.6-35b-a3b-mtp running on GTX 1060 6GB
Posted by xxvegas@reddit | LocalLLaMA | View on Reddit | 11 comments
nickless07@reddit
Best solution to generate reports locally with graphs, charts? Beginner question.
Posted by NetZeroSun@reddit | LocalLLaMA | View on Reddit | 12 comments
nickless07@reddit
Seeing the activity pop up big time in this sub due to various open models. Most of them require at least 16gb vram. What can I do with 8?
Posted by baked_tea@reddit | LocalLLaMA | View on Reddit | 13 comments
nickless07@reddit
Qwen3.6 35b-a3b 🤯
Posted by EffectiveMedium2683@reddit | LocalLLaMA | View on Reddit | 118 comments
nickless07@reddit
Qwen3.6 35b-a3b 🤯
Posted by EffectiveMedium2683@reddit | LocalLLaMA | View on Reddit | 118 comments
nickless07@reddit
Qwen3.6 35b-a3b 🤯
Posted by EffectiveMedium2683@reddit | LocalLLaMA | View on Reddit | 118 comments
nickless07@reddit
TextGen is now a native desktop app. Open-source alternative to LM Studio (formerly text-generation-webui).
Posted by oobabooga4@reddit | LocalLLaMA | View on Reddit | 221 comments
nickless07@reddit
LM Studio - 3 GPUs, one model per GPU as different servers
Posted by MarcusAurelius68@reddit | LocalLLaMA | View on Reddit | 15 comments
nickless07@reddit
LM Studio - 3 GPUs, one model per GPU as different servers
Posted by MarcusAurelius68@reddit | LocalLLaMA | View on Reddit | 15 comments
nickless07@reddit
Qwen3.6 35b-a3b 🤯
Posted by EffectiveMedium2683@reddit | LocalLLaMA | View on Reddit | 118 comments
nickless07@reddit
What's the current best small model?
Posted by Conscious_Nobody9571@reddit | LocalLLaMA | View on Reddit | 52 comments
nickless07@reddit
Does 'preserve_thinking' work with openwebui?
Posted by sterby92@reddit | LocalLLaMA | View on Reddit | 33 comments
nickless07@reddit
Does 'preserve_thinking' work with openwebui?
Posted by sterby92@reddit | LocalLLaMA | View on Reddit | 33 comments
nickless07@reddit
Does 'preserve_thinking' work with openwebui?
Posted by sterby92@reddit | LocalLLaMA | View on Reddit | 33 comments
nickless07@reddit
Does 'preserve_thinking' work with openwebui?
Posted by sterby92@reddit | LocalLLaMA | View on Reddit | 33 comments
nickless07@reddit
What's the current best small model?
Posted by Conscious_Nobody9571@reddit | LocalLLaMA | View on Reddit | 52 comments
nickless07@reddit
Has anyone set a local LLM up as a language learning tool?
Posted by OrdoRidiculous@reddit | LocalLLaMA | View on Reddit | 24 comments
nickless07@reddit
Tools in Openwebui
Posted by Radiant-Giraffe5159@reddit | LocalLLaMA | View on Reddit | 14 comments
nickless07@reddit
Tools in Openwebui
Posted by Radiant-Giraffe5159@reddit | LocalLLaMA | View on Reddit | 14 comments
nickless07@reddit
Tools in Openwebui
Posted by Radiant-Giraffe5159@reddit | LocalLLaMA | View on Reddit | 14 comments
nickless07@reddit
How do you estimate total memory usage?
Posted by HornyGooner4402@reddit | LocalLLaMA | View on Reddit | 16 comments
nickless07@reddit
How to change settings on llmster server?
Posted by FalconX88@reddit | LocalLLaMA | View on Reddit | 5 comments
nickless07@reddit
How to change settings on llmster server?
Posted by FalconX88@reddit | LocalLLaMA | View on Reddit | 5 comments
nickless07@reddit
How to change settings on llmster server?
Posted by FalconX88@reddit | LocalLLaMA | View on Reddit | 5 comments
nickless07@reddit
Have Qwen said anything about further Qwen 3.6 models?
Posted by spaceman_@reddit | LocalLLaMA | View on Reddit | 61 comments
nickless07@reddit
Anyone tried 2 different GPUs in one PC for local LLMs?
Posted by ShadowBannedAugustus@reddit | LocalLLaMA | View on Reddit | 21 comments
nickless07@reddit
Open Models - April 2026 - One of the best months of all time for Local LLMs?
Posted by pmttyji@reddit | LocalLLaMA | View on Reddit | 153 comments
nickless07@reddit
Sorry if it's not the best place to ask this, of the models in the image, which is the best for (problem solving)/Coding and the best one for studying (ask LLM concepts) ? My PC build is RX 9060 XT 16GB + I3 12100F + 16 GB DDR4 + llama.cpp with Vulkan backend + Linux Mint.
Posted by Badhunter31415@reddit | LocalLLaMA | View on Reddit | 13 comments
nickless07@reddit
If the AI bubble pops, will GPU prices increase or decrease?
Posted by Mashic@reddit | LocalLLaMA | View on Reddit | 39 comments
nickless07@reddit
Best Adventure Gaming Setup
Posted by thefool00@reddit | LocalLLaMA | View on Reddit | 11 comments
nickless07@reddit
MLX's gone today in newest LM Studio
Posted by maciejb84@reddit | LocalLLaMA | View on Reddit | 3 comments