Yu2sama
Why are there so few small local creative writing models from the Chinese?
Posted by kabachuha@reddit | LocalLLaMA | View on Reddit | 64 comments
The 4B class of 2026 (benchmark)
Posted by FederalAnalysis420@reddit | LocalLLaMA | View on Reddit | 59 comments
Yu2sama@reddit
HauhauCS (of "Uncensored Aggressive" fame) published an abliteration package that plagiarizes Heretic without attribution, and violates its license
Posted by nathandreamfast@reddit | LocalLLaMA | View on Reddit | 235 comments
Yu2sama@reddit
Forgive my ignorance but how is a 27B model better than 397B?
Posted by No_Conversation9561@reddit | LocalLLaMA | View on Reddit | 286 comments
Yu2sama@reddit
Kimi K2.6 Released (huggingface)
Posted by BiggestBau5@reddit | LocalLLaMA | View on Reddit | 277 comments
Yu2sama@reddit
Ternary Bonsai: Top intelligence at 1.58 bits
Posted by pmttyji@reddit | LocalLLaMA | View on Reddit | 89 comments
Yu2sama@reddit
These "Claude-4.6-Opus" Fine Tunes of Local Models Are Usually A Downgrade
Posted by BuffMcBigHuge@reddit | LocalLLaMA | View on Reddit | 128 comments
Yu2sama@reddit
These "Claude-4.6-Opus" Fine Tunes of Local Models Are Usually A Downgrade
Posted by BuffMcBigHuge@reddit | LocalLLaMA | View on Reddit | 128 comments
Yu2sama@reddit
These "Claude-4.6-Opus" Fine Tunes of Local Models Are Usually A Downgrade
Posted by BuffMcBigHuge@reddit | LocalLLaMA | View on Reddit | 128 comments
Yu2sama@reddit
FernflowerAI-35B-A3B-KL-ReLU-GGUF + Apple MLX
Posted by EvilEnginer@reddit | LocalLLaMA | View on Reddit | 18 comments
Yu2sama@reddit
It looks like there are no plans for smaller GLM models
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 128 comments
Yu2sama@reddit
It looks like there are no plans for smaller GLM models
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 128 comments
Yu2sama@reddit
It looks like there are no plans for smaller GLM models
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 128 comments
Yu2sama@reddit
Final voting results for Qwen 3.6
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 285 comments
Yu2sama@reddit
Is inverse LoRA distillation between Qwen 2.5 1.5B and 7B a viable idea, or just an interesting dead end?
Posted by Plus_Original_3154@reddit | LocalLLaMA | View on Reddit | 9 comments
Yu2sama@reddit
Hermes agent might be the best open source agent for local models right now
Posted by virtualunc@reddit | LocalLLaMA | View on Reddit | 31 comments
Yu2sama@reddit
Hermes agent might be the best open source agent for local models right now
Posted by virtualunc@reddit | LocalLLaMA | View on Reddit | 31 comments
Yu2sama@reddit
p-e-w/gemma-4-E2B-it-heretic-ara: Gemma 4's defenses shredded by Heretic's new ARA method 90 minutes after the official release
Posted by -p-e-w-@reddit | LocalLLaMA | View on Reddit | 82 comments
Yu2sama@reddit
Can we block fresh accounts from posting?
Posted by king_of_jupyter@reddit | LocalLLaMA | View on Reddit | 121 comments
Yu2sama@reddit
Can we block fresh accounts from posting?
Posted by king_of_jupyter@reddit | LocalLLaMA | View on Reddit | 121 comments
Yu2sama@reddit
Can we block fresh accounts from posting?
Posted by king_of_jupyter@reddit | LocalLLaMA | View on Reddit | 121 comments
Yu2sama@reddit
Bankai (卍解) — the first post-training adaptation method for true 1-bit LLMs.
Posted by Turbulent-Sky5396@reddit | LocalLLaMA | View on Reddit | 117 comments
Yu2sama@reddit
Gemma 4 released
Posted by garg-aayush@reddit | LocalLLaMA | View on Reddit | 80 comments
Yu2sama@reddit
Gemma time! What are your wishes ?
Posted by Specter_Origin@reddit | LocalLLaMA | View on Reddit | 145 comments
Yu2sama@reddit
LocalLLaMA 2026
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 133 comments
Yu2sama@reddit
Friendly reminder inference is WAY faster on Linux vs windows
Posted by triynizzles1@reddit | LocalLLaMA | View on Reddit | 111 comments
Yu2sama@reddit
Do LLMs get "lazy" outside of normal 9-to-5 hours?
Posted by DerBasti85@reddit | LocalLLaMA | View on Reddit | 18 comments
Yu2sama@reddit
The AI releases hype cycle in a nutshell
Posted by GreenBird-ee@reddit | LocalLLaMA | View on Reddit | 42 comments
Yu2sama@reddit
Anyway to get close to GPT4o on a local model (I know it’s a dumb question)
Posted by octopi917@reddit | LocalLLaMA | View on Reddit | 82 comments
Yu2sama@reddit
Qwen3.5-27B-Claude-4.6-Opus-Uncensored-V2-Kullback-Leibler-GGUF
Posted by EvilEnginer@reddit | LocalLLaMA | View on Reddit | 77 comments
Yu2sama@reddit
Qwen3.5-27B-Claude-4.6-Opus-Uncensored-V2-Kullback-Leibler-GGUF
Posted by EvilEnginer@reddit | LocalLLaMA | View on Reddit | 77 comments
Yu2sama@reddit
Assistant_Pepe_70B, beats Claude on silly questions, on occasion
Posted by Sicarius_The_First@reddit | LocalLLaMA | View on Reddit | 77 comments
Yu2sama@reddit
prompting help
Posted by ProfessionalDraw2315@reddit | LocalLLaMA | View on Reddit | 3 comments
Yu2sama@reddit
Local replacement GGUF for Claude Sonnet 4.5
Posted by SmithDoesGaming@reddit | LocalLLaMA | View on Reddit | 13 comments
Yu2sama@reddit
Are we currently in a "Golden Time" for low VRAM/1 GPU users with Qwen 27b?
Posted by inthesearchof@reddit | LocalLLaMA | View on Reddit | 117 comments
Yu2sama@reddit
I feel like if they made a local model focused specifically on RP it would be god tier even if tiny
Posted by Borkato@reddit | LocalLLaMA | View on Reddit | 27 comments
Yu2sama@reddit
I feel like if they made a local model focused specifically on RP it would be god tier even if tiny
Posted by Borkato@reddit | LocalLLaMA | View on Reddit | 27 comments
Yu2sama@reddit
Alibaba confirms they are committed to continuously open-sourcing new Qwen and Wan models
Posted by TKGaming_11@reddit | LocalLLaMA | View on Reddit | 79 comments
Yu2sama@reddit
I've seen a lot of Opus 4.6 distills, why not 5.4 pro?
Posted by FusionCow@reddit | LocalLLaMA | View on Reddit | 21 comments
Yu2sama@reddit
Why 90% of AI chatbots feel like they’re stuck in 2024.
Posted by Legendary_Outrage@reddit | LocalLLaMA | View on Reddit | 21 comments
Yu2sama@reddit
Every LLM has a default voice and it's making us all sound the same
Posted by prokajevo@reddit | LocalLLaMA | View on Reddit | 42 comments
Yu2sama@reddit
Qwen3.5-9B-Claude-4.6-Opus-Uncensored-Distilled-GGUF
Posted by EvilEnginer@reddit | LocalLLaMA | View on Reddit | 213 comments
Yu2sama@reddit
What is after Qwen ?
Posted by j_lyf@reddit | LocalLLaMA | View on Reddit | 17 comments
Yu2sama@reddit
What is after Qwen ?
Posted by j_lyf@reddit | LocalLLaMA | View on Reddit | 17 comments
Yu2sama@reddit
What do you end up doing with personal projects that were heavily assisted by an LLM?
Posted by derekp7@reddit | LocalLLaMA | View on Reddit | 11 comments
Yu2sama@reddit
Fine-tuned Qwen3 SLMs (0.6-8B) beat frontier LLMs on narrow tasks
Posted by Jolly-Gazelle-6060@reddit | LocalLLaMA | View on Reddit | 82 comments
Yu2sama@reddit
Qwen dev on Twitter!!
Posted by Difficult-Cap-7527@reddit | LocalLLaMA | View on Reddit | 61 comments
Yu2sama@reddit
The walled garden gets higher walls: Anthropic is adding weekly rate limits for paid Claude subscribers
Posted by Resident_Egg5765@reddit | LocalLLaMA | View on Reddit | 48 comments
Yu2sama@reddit
Any Rpers test the new qwen 2507 yet?
Posted by Antique_Bit_1049@reddit | LocalLLaMA | View on Reddit | 3 comments
Yu2sama@reddit
Qwen’s TRIPLE release this week + Vid Gen model coming
Posted by koc_Z3@reddit | LocalLLaMA | View on Reddit | 35 comments