mivog49274
i dedicate this meme to you r/LocalLLaMA
Posted by LPFchan@reddit | LocalLLaMA | View on Reddit | 44 comments
Breaking the music supply constraint
Posted by entsnack@reddit | LocalLLaMA | View on Reddit | 317 comments
mivog49274@reddit
Next year we're getting 0.5T model from Grok
Posted by pmttyji@reddit | LocalLLaMA | View on Reddit | 200 comments
mivog49274@reddit
meituan-longcat/LongCat-Video-Avatar-1.5 路 Hugging Face
Posted by pmttyji@reddit | LocalLLaMA | View on Reddit | 15 comments
mivog49274@reddit
DeepSeek is pushing forward with $10.29 billion financing round, with Liang Wenfeng committing to continue developing open-source AI models rather than pursuing short-term commercialization goals
Posted by External_Mood4719@reddit | LocalLLaMA | View on Reddit | 119 comments
mivog49274@reddit
Qwen 3.7 droped on Qwen Chat
Posted by Foxiya@reddit | LocalLLaMA | View on Reddit | 221 comments
mivog49274@reddit
Qwen 3.7 droped on Qwen Chat
Posted by Foxiya@reddit | LocalLLaMA | View on Reddit | 221 comments
mivog49274@reddit
Qwen 3.7 droped on Qwen Chat
Posted by Foxiya@reddit | LocalLLaMA | View on Reddit | 221 comments
mivog49274@reddit
Will there be any more Qwen3.6 series models?
Posted by cafedude@reddit | LocalLLaMA | View on Reddit | 102 comments
mivog49274@reddit
New "major breakthrough?" architecture SubQ
Posted by Daemontatox@reddit | LocalLLaMA | View on Reddit | 37 comments
mivog49274@reddit
Qwen/SAE-Res-Qwen3.5-27B-W80K-L0_100 路 Hugging Face
Posted by FaustAg@reddit | LocalLLaMA | View on Reddit | 15 comments
mivog49274@reddit
Decreased Intelligence Density in DeepSeek V4 Pro
Posted by Mindless_Pain1860@reddit | LocalLLaMA | View on Reddit | 90 comments
mivog49274@reddit
Qwen 3.6 Max Preview just went live on the Qwen Chat website. It currently has the highest AA-Intelligence Index score among Chinese models (52) (Will it be open source?)
Posted by Nunki08@reddit | LocalLLaMA | View on Reddit | 92 comments
mivog49274@reddit
LLM Neuroanatomy III - LLMs seem to think in geometry, not language
Posted by Reddactor@reddit | LocalLLaMA | View on Reddit | 100 comments
mivog49274@reddit
[Model Release] I trained a 9B model to be agentic Data Analyst (Qwen3.5-9B + LoRA). Base model failed 100%, this LoRA completes 89% of workflows without human intervention.
Posted by Awkward_Run_9982@reddit | LocalLLaMA | View on Reddit | 45 comments
mivog49274@reddit
One year later: this question feels a lot less crazy
Posted by gamblingapocalypse@reddit | LocalLLaMA | View on Reddit | 48 comments
mivog49274@reddit
We aren鈥檛 even close to AGI
Posted by CrimsonShikabane@reddit | LocalLLaMA | View on Reddit | 314 comments
mivog49274@reddit
Gemma
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 32 comments
mivog49274@reddit
Gemma
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 32 comments
mivog49274@reddit
Analyzing Claude Code Source Code. Write "WTF" and Anthropic knows.
Posted by QuantumSeeds@reddit | LocalLLaMA | View on Reddit | 170 comments
mivog49274@reddit
Talking with the people that spam their AI slop is actually really fun!
Posted by EffectiveCeilingFan@reddit | LocalLLaMA | View on Reddit | 42 comments
mivog49274@reddit
Talking with the people that spam their AI slop is actually really fun!
Posted by EffectiveCeilingFan@reddit | LocalLLaMA | View on Reddit | 42 comments
mivog49274@reddit
OpenCode concerns (not truely local)
Posted by Ueberlord@reddit | LocalLLaMA | View on Reddit | 185 comments
mivog49274@reddit
OpenCode concerns (not truely local)
Posted by Ueberlord@reddit | LocalLLaMA | View on Reddit | 185 comments
mivog49274@reddit
What is Hunter Alpha?
Posted by MrMrsPotts@reddit | LocalLLaMA | View on Reddit | 144 comments
mivog49274@reddit
New benchmark just dropped.
Posted by ConfidentDinner6648@reddit | LocalLLaMA | View on Reddit | 140 comments
mivog49274@reddit
More quantization visualization types (repost)
Posted by copingmechanism@reddit | LocalLLaMA | View on Reddit | 51 comments
mivog49274@reddit
More quantization visualization types (repost)
Posted by copingmechanism@reddit | LocalLLaMA | View on Reddit | 51 comments
mivog49274@reddit
GLM-5 is officially on NVIDIA NIM, and you can now use it to power Claude Code for FREE 馃殌
Posted by PreparationAny8816@reddit | LocalLLaMA | View on Reddit | 40 comments
mivog49274@reddit
GLM-5 is officially on NVIDIA NIM, and you can now use it to power Claude Code for FREE 馃殌
Posted by PreparationAny8816@reddit | LocalLLaMA | View on Reddit | 40 comments
mivog49274@reddit
Nemotron-3-nano:30b is a spectacular general purpose local LLM
Posted by DrewGrgich@reddit | LocalLLaMA | View on Reddit | 133 comments
mivog49274@reddit
ISRM: Infinitely Scalable Recursive Model
Posted by Available-Craft-5795@reddit | LocalLLaMA | View on Reddit | 12 comments
mivog49274@reddit
GLM 4.7 is out on HF!
Posted by KvAk_AKPlaysYT@reddit | LocalLLaMA | View on Reddit | 131 comments
mivog49274@reddit
Is Grokipedia available for fine-tuning?
Posted by Chance-Studio-8242@reddit | LocalLLaMA | View on Reddit | 30 comments
mivog49274@reddit
Celebrating 1 year anniversary of the revolutionary game changing LLM that was Reflection 70b
Posted by LosEagle@reddit | LocalLLaMA | View on Reddit | 20 comments
mivog49274@reddit
What you think it will be..
Posted by Independent-Wind4462@reddit | LocalLLaMA | View on Reddit | 139 comments
mivog49274@reddit
deepseek-ai/DeepSeek-V3.1-Base 路 Hugging Face
Posted by xLionel775@reddit | LocalLLaMA | View on Reddit | 196 comments
mivog49274@reddit
deepseek-ai/DeepSeek-V3.1-Base 路 Hugging Face
Posted by xLionel775@reddit | LocalLLaMA | View on Reddit | 196 comments
mivog49274@reddit
deepseek-ai/DeepSeek-V3.1-Base 路 Hugging Face
Posted by xLionel775@reddit | LocalLLaMA | View on Reddit | 196 comments
mivog49274@reddit
GPT-OSS Benchmarks: How GPT-OSS-120B Performs in Real Tasks
Posted by facethef@reddit | LocalLLaMA | View on Reddit | 79 comments
mivog49274@reddit
We鈥檙e definitely keeping him up at night right now.
Posted by Porespellar@reddit | LocalLLaMA | View on Reddit | 35 comments
mivog49274@reddit
OpenAI's open-weight model will debut as soon as next week
Posted by phantasm_ai@reddit | LocalLLaMA | View on Reddit | 115 comments
mivog49274@reddit
Google releases MagentaRT for real time music generation
Posted by hackerllama@reddit | LocalLLaMA | View on Reddit | 81 comments
mivog49274@reddit
INTELLECT-2 finished training today
Posted by kmouratidis@reddit | LocalLLaMA | View on Reddit | 21 comments
mivog49274@reddit
Qwen3-30B-A3B solves the o1-preview Cipher problem!
Posted by sunpazed@reddit | LocalLLaMA | View on Reddit | 18 comments
mivog49274@reddit
What's interesting is that Qwen's release is three months behind Deepseek's. So, if you believe Qwen 3 is currently the leader in open source, I don't think that will last, as R2 is on the verge of release. You can see the gap between Qwen 3 and the three-month-old Deepseek R1.
Posted by Select_Dream634@reddit | LocalLLaMA | View on Reddit | 53 comments
mivog49274@reddit
Open-Weights Model next week?
Posted by MustBeSomethingThere@reddit | LocalLLaMA | View on Reddit | 75 comments
mivog49274@reddit
How do LLMs actually do this?
Posted by No-Conference-8133@reddit | LocalLLaMA | View on Reddit | 270 comments
mivog49274@reddit
NoLiMa: Long-Context Evaluation Beyond Literal Matching - Finally a good benchmark that shows just how bad LLM performance is at long context. Massive drop at just 32k context for all models.
Posted by jd_3d@reddit | LocalLLaMA | View on Reddit | 111 comments
mivog49274@reddit
o3-mini is now the SOTA coding model. It is truly something to behold. Procedural clouds in one-shot.
Posted by LocoMod@reddit | LocalLLaMA | View on Reddit | 228 comments