ninjasaid13
gemma-4-12b-it vs Qwen3.5-9B on shared benchmarks: Qwen is overall winner beating gemma in 5/8 benchmarks despite a smaller footprint
Posted by fulgencio_batista@reddit | LocalLLaMA | View on Reddit | 151 comments
Qwen cant wait to release 3.7 models
Posted by GotHereLateNameTaken@reddit | LocalLLaMA | View on Reddit | 276 comments
ninjasaid13@reddit
I Let a Small Model Train on Its Own Mistakes. It Reached 80% on HumanEval and Beat GPT-3.5 on Math
Posted by QuantumSeeds@reddit | LocalLLaMA | View on Reddit | 59 comments
ninjasaid13@reddit
<thinking></thinking>
Posted by Comfortable-Rock-498@reddit | LocalLLaMA | View on Reddit | 89 comments
ninjasaid13@reddit
<thinking></thinking>
Posted by Comfortable-Rock-498@reddit | LocalLLaMA | View on Reddit | 89 comments
ninjasaid13@reddit
Gemma 4 MTP released
Posted by rerri@reddit | LocalLLaMA | View on Reddit | 301 comments
ninjasaid13@reddit
<thinking></thinking>
Posted by Comfortable-Rock-498@reddit | LocalLLaMA | View on Reddit | 89 comments
ninjasaid13@reddit
<thinking></thinking>
Posted by Comfortable-Rock-498@reddit | LocalLLaMA | View on Reddit | 89 comments
ninjasaid13@reddit
<thinking></thinking>
Posted by Comfortable-Rock-498@reddit | LocalLLaMA | View on Reddit | 89 comments
ninjasaid13@reddit
<thinking></thinking>
Posted by Comfortable-Rock-498@reddit | LocalLLaMA | View on Reddit | 89 comments
ninjasaid13@reddit
Best Local LLMs - Apr 2026
Posted by rm-rf-rm@reddit | LocalLLaMA | View on Reddit | 365 comments
ninjasaid13@reddit
Embracing the noise: How to build an agent that is both neuro-symbolic and probabilistic.
Posted by DepthOk4115@reddit | LocalLLaMA | View on Reddit | 10 comments
ninjasaid13@reddit
Decreased Intelligence Density in DeepSeek V4 Pro
Posted by Mindless_Pain1860@reddit | LocalLLaMA | View on Reddit | 90 comments
ninjasaid13@reddit
r/LocalLLaMa Rule Updates
Posted by rm-rf-rm@reddit | LocalLLaMA | View on Reddit | 121 comments
ninjasaid13@reddit
I made a tiny world model game that runs locally on iPad
Posted by howthefrondsfold@reddit | LocalLLaMA | View on Reddit | 27 comments
ninjasaid13@reddit
Qwen-Image-2.0 is out - 7B unified gen+edit model with native 2K and actual text rendering
Posted by RIPT1D3_Z@reddit | LocalLLaMA | View on Reddit | 120 comments
ninjasaid13@reddit
Meta to open source versions of its next AI models
Posted by abkibaarnsit@reddit | LocalLLaMA | View on Reddit | 62 comments
ninjasaid13@reddit
Mistral AI to release Voxtral TTS, a 3-billion-parameter text-to-speech model with open weights that the company says outperformed ElevenLabs Flash v2.5 in human preference tests. The model runs on about 3 GB of RAM, achieves 90-millisecond time-to-first-audio, supports nine languages.
Posted by Nunki08@reddit | LocalLLaMA | View on Reddit | 186 comments
ninjasaid13@reddit
Introducing ARC-AGI-3
Posted by Complete-Sea6655@reddit | LocalLLaMA | View on Reddit | 100 comments
ninjasaid13@reddit
Introducing ARC-AGI-3
Posted by Complete-Sea6655@reddit | LocalLLaMA | View on Reddit | 100 comments
ninjasaid13@reddit
High school student seeking advice: Found an architectural breakthrough that scales a 17.6B model down to 417M?
Posted by Appropriate-Scar3116@reddit | LocalLLaMA | View on Reddit | 210 comments
ninjasaid13@reddit
Qwen3.5B VS the SOTA same size models from 2 years ago.
Posted by Uncle___Marty@reddit | LocalLLaMA | View on Reddit | 59 comments
ninjasaid13@reddit
Qwen 2.5 -> 3 -> 3.5, smallest models. Incredible improvement over the generations.
Posted by airbus_a360_when@reddit | LocalLLaMA | View on Reddit | 136 comments
ninjasaid13@reddit
Qwen3.5-397B-A17B-UD-TQ1 bench results FW Desktop Strix Halo 128GB
Posted by dabiggmoe2@reddit | LocalLLaMA | View on Reddit | 58 comments
ninjasaid13@reddit
meanwhile in China
Posted by Tiny_Judge_2119@reddit | LocalLLaMA | View on Reddit | 33 comments
ninjasaid13@reddit
How I mapped every High Court of Australia case and their citations (1901-2025)
Posted by Neon0asis@reddit | LocalLLaMA | View on Reddit | 6 comments
ninjasaid13@reddit
Pack it up guys, open weight AI models running offline locally on PCs aren't real. 😞
Posted by CesarOverlorde@reddit | LocalLLaMA | View on Reddit | 294 comments
ninjasaid13@reddit
Anthropic is deploying 20M$ to support AI regulation in sight of 2026 elections
Posted by 1998marcom@reddit | LocalLLaMA | View on Reddit | 81 comments
ninjasaid13@reddit
Anthropic is deploying 20M$ to support AI regulation in sight of 2026 elections
Posted by 1998marcom@reddit | LocalLLaMA | View on Reddit | 81 comments
ninjasaid13@reddit
Qwen-Image-2.0 is out - 7B unified gen+edit model with native 2K and actual text rendering
Posted by RIPT1D3_Z@reddit | LocalLLaMA | View on Reddit | 120 comments
ninjasaid13@reddit
Qwen-Image-2.0 is out - 7B unified gen+edit model with native 2K and actual text rendering
Posted by RIPT1D3_Z@reddit | LocalLLaMA | View on Reddit | 120 comments
ninjasaid13@reddit
I'm playing telephone pictionary with LLMs, VLMs, SDs, and Kokoro on my Strix Halo
Posted by jfowers_amd@reddit | LocalLLaMA | View on Reddit | 9 comments
ninjasaid13@reddit
Unsloth just unleashed Glm 5! GGUF NOW!
Posted by RickyRickC137@reddit | LocalLLaMA | View on Reddit | 82 comments
ninjasaid13@reddit
New stealth model: Pony Alpha
Posted by sirjoaco@reddit | LocalLLaMA | View on Reddit | 30 comments
ninjasaid13@reddit
New stealth model: Pony Alpha
Posted by sirjoaco@reddit | LocalLLaMA | View on Reddit | 30 comments
ninjasaid13@reddit
We built an 8B world model that beats 402B Llama 4 by generating web code instead of pixels — open weights on HF
Posted by jshin49@reddit | LocalLLaMA | View on Reddit | 46 comments
ninjasaid13@reddit
Fei Fei Li dropped a non-JEPA world model, and the spatial intelligence is insane
Posted by coloradical5280@reddit | LocalLLaMA | View on Reddit | 90 comments
ninjasaid13@reddit
Fei Fei Li dropped a non-JEPA world model, and the spatial intelligence is insane
Posted by coloradical5280@reddit | LocalLLaMA | View on Reddit | 90 comments
ninjasaid13@reddit
Fei Fei Li dropped a non-JEPA world model, and the spatial intelligence is insane
Posted by coloradical5280@reddit | LocalLLaMA | View on Reddit | 90 comments
ninjasaid13@reddit
Fei Fei Li dropped a non-JEPA world model, and the spatial intelligence is insane
Posted by coloradical5280@reddit | LocalLLaMA | View on Reddit | 90 comments
ninjasaid13@reddit
Fei Fei Li dropped a non-JEPA world model, and the spatial intelligence is insane
Posted by coloradical5280@reddit | LocalLLaMA | View on Reddit | 90 comments
ninjasaid13@reddit
GLM-Image is released!
Posted by foldl-li@reddit | LocalLLaMA | View on Reddit | 82 comments
ninjasaid13@reddit
The current state of sparse-MoE's for agentic coding work (Opinion)
Posted by ForsookComparison@reddit | LocalLLaMA | View on Reddit | 80 comments
ninjasaid13@reddit
Apple introduces SHARP, a model that generates a photorealistic 3D Gaussian representation from a single image in seconds.
Posted by themixtergames@reddit | LocalLLaMA | View on Reddit | 140 comments
ninjasaid13@reddit
Apple introduces SHARP, a model that generates a photorealistic 3D Gaussian representation from a single image in seconds.
Posted by themixtergames@reddit | LocalLLaMA | View on Reddit | 140 comments
ninjasaid13@reddit
Basketball AI with RF-DETR, SAM2, and SmolVLM2
Posted by RandomForests92@reddit | LocalLLaMA | View on Reddit | 48 comments
ninjasaid13@reddit
WTF! Is this real? Teenagers are building AGI Research Lab
Posted by Illustrious-Yak-9195@reddit | LocalLLaMA | View on Reddit | 16 comments
ninjasaid13@reddit
When you figure out it’s all just math:
Posted by Current-Ticket4214@reddit | LocalLLaMA | View on Reddit | 381 comments
ninjasaid13@reddit
When you figure out it’s all just math:
Posted by Current-Ticket4214@reddit | LocalLLaMA | View on Reddit | 381 comments
ninjasaid13@reddit
Microsoft’s AI Scientist
Posted by Ok-Breakfast-4676@reddit | LocalLLaMA | View on Reddit | 36 comments