FluoroquinolonesKill
google/gemma-4-12B · Hugging Face
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 285 comments
Gemma 4 MTP released
Posted by rerri@reddit | LocalLLaMA | View on Reddit | 301 comments
FluoroquinolonesKill@reddit
Local AI is the best
Posted by fake_agent_smith@reddit | LocalLLaMA | View on Reddit | 60 comments
FluoroquinolonesKill@reddit
If it works - don’t touch it: COMPETITION
Posted by awfulalexey@reddit | LocalLLaMA | View on Reddit | 112 comments
FluoroquinolonesKill@reddit
Gemma 4 - lazy model or am I crazy? (bit of a rant)
Posted by Pyrenaeda@reddit | LocalLLaMA | View on Reddit | 151 comments
FluoroquinolonesKill@reddit
Gemma 4 31B vs Qwen 3.5 27B: Which is best for long context worklows? My THOUGHTS...
Posted by GrungeWerX@reddit | LocalLLaMA | View on Reddit | 174 comments
FluoroquinolonesKill@reddit
More Gemma4 fixes in the past 24 hours
Posted by andy2na@reddit | LocalLLaMA | View on Reddit | 120 comments
FluoroquinolonesKill@reddit
Gemma 4 on Llama.cpp should be stable now
Posted by ilintar@reddit | LocalLLaMA | View on Reddit | 167 comments
FluoroquinolonesKill@reddit
I think my Gemma4 is having a breakdown
Posted by MrSilencerbob@reddit | LocalLLaMA | View on Reddit | 20 comments
FluoroquinolonesKill@reddit
It looks like we’ll need to download the new Gemma 4 GGUFs
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 147 comments
FluoroquinolonesKill@reddit
so…. Qwen3.5 or Gemma 4?
Posted by MLExpert000@reddit | LocalLLaMA | View on Reddit | 121 comments
FluoroquinolonesKill@reddit
Gemma 4 has been abliterated
Posted by coder3101@reddit | LocalLLaMA | View on Reddit | 26 comments
FluoroquinolonesKill@reddit
Gemma 4 has been released
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 702 comments
FluoroquinolonesKill@reddit
Gemma 4 has been released
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 702 comments
FluoroquinolonesKill@reddit
#OpenSource4o Movement Trending on Twitter/X - Release Opensource of GPT-4o
Posted by pmttyji@reddit | LocalLLaMA | View on Reddit | 183 comments
FluoroquinolonesKill@reddit
I fine-tuned Qwen3.5-27B with 35k examples into an AI companion - after 2,000 conversations here’s what actually matters for personality
Posted by Crypto_Stoozy@reddit | LocalLLaMA | View on Reddit | 59 comments
FluoroquinolonesKill@reddit
Heretic has FINALLY defeated GPT-OSS with a new experimental decensoring method called ARA
Posted by pigeon57434@reddit | LocalLLaMA | View on Reddit | 152 comments
FluoroquinolonesKill@reddit
Heretic has FINALLY defeated GPT-OSS with a new experimental decensoring method called ARA
Posted by pigeon57434@reddit | LocalLLaMA | View on Reddit | 152 comments
FluoroquinolonesKill@reddit
webui: Agentic Loop + MCP Client with support for Tools, Resources and Prompts has been merged into llama.cpp
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 59 comments
FluoroquinolonesKill@reddit
webui: Agentic Loop + MCP Client with support for Tools, Resources and Prompts has been merged into llama.cpp
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 59 comments
FluoroquinolonesKill@reddit
webui: Agentic Loop + MCP Client with support for Tools, Resources and Prompts has been merged into llama.cpp
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 59 comments
FluoroquinolonesKill@reddit
Qwen3.5 35b UD Q4 K XL Prior to 3/5 worked great, now not so much...
Posted by thejacer@reddit | LocalLLaMA | View on Reddit | 19 comments
FluoroquinolonesKill@reddit
webui: Agentic Loop + MCP Client with support for Tools, Resources and Prompts has been merged into llama.cpp
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 59 comments
FluoroquinolonesKill@reddit
Qwen3.5 "Low Reasoning Effort" trick in llama-server
Posted by coder543@reddit | LocalLLaMA | View on Reddit | 22 comments
FluoroquinolonesKill@reddit
Qwen 3.5 27-35-122B - Jinja Template Modification (Based on Bartowski's Jinja) - No thinking by default - straight quick answers, need thinking? simple activation with "/think" command anywhere in the system prompt.
Posted by -Ellary-@reddit | LocalLLaMA | View on Reddit | 26 comments
FluoroquinolonesKill@reddit
Qwen 3.5 Jinja Template – Restores Qwen /no_thinking behavior!
Posted by Substantial_Swan_144@reddit | LocalLLaMA | View on Reddit | 14 comments
FluoroquinolonesKill@reddit
You can use Qwen3.5 without thinking
Posted by guiopen@reddit | LocalLLaMA | View on Reddit | 86 comments
FluoroquinolonesKill@reddit
Nemo 30B is insane. 1M+ token CTX on one 3090
Posted by Dismal-Effect-1914@reddit | LocalLLaMA | View on Reddit | 112 comments
FluoroquinolonesKill@reddit
ACE-Step-1.5 has just been released. It’s an MIT-licensed open source audio generative model with performance close to commercial platforms like Suno
Posted by iGermanProd@reddit | LocalLLaMA | View on Reddit | 138 comments
FluoroquinolonesKill@reddit
GLM 4.7 Flash: Huge performance improvement with -kvu
Posted by TokenRingAI@reddit | LocalLLaMA | View on Reddit | 72 comments
FluoroquinolonesKill@reddit
KV cache fix for GLM 4.7 Flash
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 73 comments
FluoroquinolonesKill@reddit
Quiet Threadripper AI Workstation - 768GB DDR5 and 160GB VRAM (RTX 5090 + 4x R9700)
Posted by sloptimizer@reddit | LocalLLaMA | View on Reddit | 99 comments
FluoroquinolonesKill@reddit
GLM 4.7 Flash official support merged in llama.cpp
Posted by ayylmaonade@reddit | LocalLLaMA | View on Reddit | 64 comments
FluoroquinolonesKill@reddit
GLM 4.7 Flash official support merged in llama.cpp
Posted by ayylmaonade@reddit | LocalLLaMA | View on Reddit | 64 comments
FluoroquinolonesKill@reddit
If you dont think Ai is an emergency you are about to have issues...
Posted by CannyGardener@reddit | preppers | View on Reddit | 813 comments
FluoroquinolonesKill@reddit
Mistral Small Creative!?
Posted by LoveMind_AI@reddit | LocalLLaMA | View on Reddit | 22 comments
FluoroquinolonesKill@reddit
Mistral Small Creative!?
Posted by LoveMind_AI@reddit | LocalLLaMA | View on Reddit | 22 comments
FluoroquinolonesKill@reddit
Mistral Small Creative!?
Posted by LoveMind_AI@reddit | LocalLLaMA | View on Reddit | 22 comments
FluoroquinolonesKill@reddit
My little decentralized Locallama setup, 216gb VRAM
Posted by Goldkoron@reddit | LocalLLaMA | View on Reddit | 154 comments
FluoroquinolonesKill@reddit
Mistral 3 14b against the competition ?
Posted by EffectiveGlove1651@reddit | LocalLLaMA | View on Reddit | 25 comments
FluoroquinolonesKill@reddit
My experiences with the new Ministral 3 14B Reasoning 2512 Q8
Posted by egomarker@reddit | LocalLLaMA | View on Reddit | 106 comments
FluoroquinolonesKill@reddit
My experiences with the new Ministral 3 14B Reasoning 2512 Q8
Posted by egomarker@reddit | LocalLLaMA | View on Reddit | 106 comments
FluoroquinolonesKill@reddit
My experiences with the new Ministral 3 14B Reasoning 2512 Q8
Posted by egomarker@reddit | LocalLLaMA | View on Reddit | 106 comments
FluoroquinolonesKill@reddit
My experiences with the new Ministral 3 14B Reasoning 2512 Q8
Posted by egomarker@reddit | LocalLLaMA | View on Reddit | 106 comments
FluoroquinolonesKill@reddit
Mistral 3 Blog post
Posted by rerri@reddit | LocalLLaMA | View on Reddit | 173 comments
FluoroquinolonesKill@reddit
Ministral-3 has been released
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 61 comments
FluoroquinolonesKill@reddit
I have a RTX5090 and an AMD AI MAX+ 95 128GB. Which benchmark do you want me to run?
Posted by foogitiff@reddit | LocalLLaMA | View on Reddit | 36 comments
FluoroquinolonesKill@reddit
I have a RTX5090 and an AMD AI MAX+ 95 128GB. Which benchmark do you want me to run?
Posted by foogitiff@reddit | LocalLLaMA | View on Reddit | 36 comments
FluoroquinolonesKill@reddit
Budget Hardware Recommendations (1.3k)
Posted by xxxmralbinoxxx@reddit | LocalLLaMA | View on Reddit | 5 comments
FluoroquinolonesKill@reddit
if open-webui is trash, whats the next best thing available to use?
Posted by Tricky_Reflection_75@reddit | LocalLLaMA | View on Reddit | 173 comments