Marcuss2
KVarN: new KV-cache quant from Huawei. 3–5× KV cache compression with actual speed-up instead of slow-down, and unlike TurboQuant it holds up on reasoning (Apache 2.0, vLLM single flag)
Posted by acluk90@reddit | LocalLLaMA | View on Reddit | 94 comments
New DeepSWE benchmark finds Claude Opus cheats
Posted by DeltaSqueezer@reddit | LocalLLaMA | View on Reddit | 92 comments
Marcuss2@reddit
ZAYA1-8B: Frontier intelligence density, trained on AMD
Posted by carbocation@reddit | LocalLLaMA | View on Reddit | 108 comments
Marcuss2@reddit
Kimi K2.6 vs DeepSeek V4 Pro
Posted by bigboyparpa@reddit | LocalLLaMA | View on Reddit | 38 comments
Marcuss2@reddit
ibm-granite/granite-4.1-8b · Hugging Face
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 35 comments
Marcuss2@reddit
ibm-granite/granite-4.1-8b · Hugging Face
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 35 comments
Marcuss2@reddit
ibm-granite/granite-4.1-8b · Hugging Face
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 35 comments
Marcuss2@reddit
Kimi K2.6 Released (huggingface)
Posted by BiggestBau5@reddit | LocalLLaMA | View on Reddit | 277 comments
Marcuss2@reddit
Llama4 108b $800 setup
Posted by kylerrr02@reddit | LocalLLaMA | View on Reddit | 13 comments
Marcuss2@reddit
Experiment: Olmo 3 7B Instruct Q1_0
Posted by butlan@reddit | LocalLLaMA | View on Reddit | 47 comments
Marcuss2@reddit
It looks like there are no plans for smaller GLM models
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 128 comments
Marcuss2@reddit
It looks like there are no plans for smaller GLM models
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 128 comments
Marcuss2@reddit
Announcement: Temporary LLM Content Ban
Posted by ChemicalRascal@reddit | programming | View on Reddit | 326 comments
Marcuss2@reddit
PrismML — Announcing 1-bit Bonsai: The First Commercially Viable 1-bit LLMs
Posted by brown2green@reddit | LocalLLaMA | View on Reddit | 182 comments
Marcuss2@reddit
Kimi K2.6 will drop in the next 2 weeks, K3 is WIP and will be huge
Posted by No-Thought-4995@reddit | LocalLLaMA | View on Reddit | 68 comments
Marcuss2@reddit
Introducing ARC-AGI-3
Posted by Complete-Sea6655@reddit | LocalLLaMA | View on Reddit | 100 comments
Marcuss2@reddit
OpenCode source code audit: 7 external domains contacted, no privacy policy, 12 community PRs unmerged for 3+ months
Posted by Spotty_Weldah@reddit | LocalLLaMA | View on Reddit | 48 comments
Marcuss2@reddit
OpenCode source code audit: 7 external domains contacted, no privacy policy, 12 community PRs unmerged for 3+ months
Posted by Spotty_Weldah@reddit | LocalLLaMA | View on Reddit | 48 comments
Marcuss2@reddit
Total beginner here—Why is LM Studio making me do the "heavy lifting" manually?
Posted by Ofer1984@reddit | LocalLLaMA | View on Reddit | 121 comments
Marcuss2@reddit
Application code has dozens of static analyzers, SQL has almost nothing, here's what exists.
Posted by Anonymedemerde@reddit | programming | View on Reddit | 29 comments
Marcuss2@reddit
Qwen3.5 27B vs Devstral Small 2 - Next.js & Solidity (Hardhat)
Posted by Holiday_Purpose_3166@reddit | LocalLLaMA | View on Reddit | 43 comments
Marcuss2@reddit
Qwen3.5 27B vs Devstral Small 2 - Next.js & Solidity (Hardhat)
Posted by Holiday_Purpose_3166@reddit | LocalLLaMA | View on Reddit | 43 comments
Marcuss2@reddit
24gb M4 Mac Mini vs 9070XT + 32gb system RAM. What to expect?
Posted by Soft-Distance-6571@reddit | LocalLLaMA | View on Reddit | 17 comments
Marcuss2@reddit
Why Senior Engineers Let Bad Projects Fail
Posted by Ordinary_Leader_2971@reddit | programming | View on Reddit | 121 comments
Marcuss2@reddit
D7VK 1.1 adds experimental Direct3D 6 support for classic PC games on Linux
Posted by RenatsMC@reddit | linux | View on Reddit | 18 comments
Marcuss2@reddit
D7VK 1.1 adds experimental Direct3D 6 support for classic PC games on Linux
Posted by RenatsMC@reddit | linux | View on Reddit | 18 comments
Marcuss2@reddit
NVIDIA Nemotron 3 Nano 30B A3B released
Posted by rerri@reddit | LocalLLaMA | View on Reddit | 96 comments
Marcuss2@reddit
Aquif 3.5 Max 1205 (42B-A3B)
Posted by Holiday_Purpose_3166@reddit | LocalLLaMA | View on Reddit | 56 comments
Marcuss2@reddit
Micron Announces Exit from Crucial Consumer Business
Posted by FullstackSensei@reddit | LocalLLaMA | View on Reddit | 190 comments
Marcuss2@reddit
Qwen3 Next almost ready in llama.cpp
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 36 comments
Marcuss2@reddit
AMD Ryzen AI Max 395+ 256/512 GB Ram?
Posted by quantier@reddit | LocalLLaMA | View on Reddit | 91 comments
Marcuss2@reddit
AMD Ryzen AI Max 395+ 256/512 GB Ram?
Posted by quantier@reddit | LocalLLaMA | View on Reddit | 91 comments
Marcuss2@reddit
New Qwen models are unbearable
Posted by kevin_1994@reddit | LocalLLaMA | View on Reddit | 293 comments
Marcuss2@reddit
MiniMax LLM head confirms: new model M2.1 coming soon
Posted by External_Mood4719@reddit | LocalLLaMA | View on Reddit | 8 comments
Marcuss2@reddit
Want to run claude like model on ~$10k budget. Please help me with the machine build. I don't want to spend on cloud.
Posted by LordSteinggard@reddit | LocalLLaMA | View on Reddit | 131 comments
Marcuss2@reddit
Kimi Linear released
Posted by Badger-Purple@reddit | LocalLLaMA | View on Reddit | 65 comments
Marcuss2@reddit
Kimi Linear released
Posted by Badger-Purple@reddit | LocalLLaMA | View on Reddit | 65 comments
Marcuss2@reddit
Kimi Linear released
Posted by Badger-Purple@reddit | LocalLLaMA | View on Reddit | 65 comments
Marcuss2@reddit
Kimi Linear released
Posted by Badger-Purple@reddit | LocalLLaMA | View on Reddit | 65 comments
Marcuss2@reddit
Kimi Linear released
Posted by Badger-Purple@reddit | LocalLLaMA | View on Reddit | 65 comments
Marcuss2@reddit
The average codebase is now 50% dependencies — is this sustainable?
Posted by Legitimate_Sun1783@reddit | programming | View on Reddit | 288 comments
Marcuss2@reddit
Sparse Adaptive Attention “MoE”: How I Solved OpenAI’s $650B Problem With a £700 GPU
Posted by EconomicConstipator@reddit | LocalLLaMA | View on Reddit | 107 comments
Marcuss2@reddit
MoonshotAI/kimi-cli - CLI coding agent from MoonshotAI
Posted by nullmove@reddit | LocalLLaMA | View on Reddit | 7 comments
Marcuss2@reddit
Qwen3 Next support in llama.cpp ready for review
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 51 comments
Marcuss2@reddit
Granite 4.0 Language Models - a ibm-granite Collection
Posted by rerri@reddit | LocalLLaMA | View on Reddit | 264 comments
Marcuss2@reddit
Elmo is providing
Posted by vladlearns@reddit | LocalLLaMA | View on Reddit | 163 comments
Marcuss2@reddit
Elmo is providing
Posted by vladlearns@reddit | LocalLLaMA | View on Reddit | 163 comments
Marcuss2@reddit
Elmo is providing
Posted by vladlearns@reddit | LocalLLaMA | View on Reddit | 163 comments
Marcuss2@reddit
Falcon-H1 technical report release
Posted by JingweiZUO@reddit | LocalLLaMA | View on Reddit | 14 comments
Marcuss2@reddit
Running an LLM on a PS Vita
Posted by ajunior7@reddit | LocalLLaMA | View on Reddit | 13 comments