LegacyRemaster
nex-agi/Nex-N2-mini • Huggingface
Posted by External_Mood4719@reddit | LocalLLaMA | View on Reddit | 16 comments
Does anyone have news about the next GLM or Kimi model?
Posted by ihatebeinganonymous@reddit | LocalLLaMA | View on Reddit | 12 comments
LegacyRemaster@reddit
Skip Nvidia New Spark Laptops?
Posted by Hannibalj2ca@reddit | LocalLLaMA | View on Reddit | 45 comments
LegacyRemaster@reddit
Big Model Value Wars - DeepSeek V4 Pro vs MiMo-V2.5-Pro vs MiniMax M3
Posted by valtor2@reddit | LocalLLaMA | View on Reddit | 4 comments
LegacyRemaster@reddit
Qwen 3.7 Plus just briefly appeared and then disappeared on OpenRouter.
Posted by ihatebeinganonymous@reddit | LocalLLaMA | View on Reddit | 28 comments
LegacyRemaster@reddit
Qwen 3.7 Plus just briefly appeared and then disappeared on OpenRouter.
Posted by ihatebeinganonymous@reddit | LocalLLaMA | View on Reddit | 28 comments
LegacyRemaster@reddit
Calling it now Microsoft is buying Unsloth.
Posted by Wrong_Mushroom_7350@reddit | LocalLLaMA | View on Reddit | 336 comments
LegacyRemaster@reddit
I burned a weekend making the models "remember" me. The fix had nothing to do with trying to run bigger models locally
Posted by shbong@reddit | LocalLLaMA | View on Reddit | 20 comments
LegacyRemaster@reddit
ui: Add Thinking mode toggle with reasoning effort levels + improvements for Chat Form Add Action UI by allozaur · Pull Request #23434 · ggml-org/llama.cpp
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 9 comments
LegacyRemaster@reddit
next MiniMax will be released in ~10 Days
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 52 comments
LegacyRemaster@reddit
next MiniMax will be released in ~10 Days
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 52 comments
LegacyRemaster@reddit
next MiniMax will be released in ~10 Days
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 52 comments
LegacyRemaster@reddit
next MiniMax will be released in ~10 Days
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 52 comments
LegacyRemaster@reddit
next MiniMax will be released in ~10 Days
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 52 comments
LegacyRemaster@reddit
Stepfun 3.7 Flash is very good
Posted by -dysangel-@reddit | LocalLLaMA | View on Reddit | 89 comments
LegacyRemaster@reddit
Stepfun 3.7 Flash is very good
Posted by -dysangel-@reddit | LocalLLaMA | View on Reddit | 89 comments
LegacyRemaster@reddit
Stepfun 3.7 Flash is very good
Posted by -dysangel-@reddit | LocalLLaMA | View on Reddit | 89 comments
LegacyRemaster@reddit
My home data center
Posted by alecKarfonta@reddit | LocalLLaMA | View on Reddit | 86 comments
LegacyRemaster@reddit
Cost Analysis of my $6.4k Local LLM Server
Posted by 1ncehost@reddit | LocalLLaMA | View on Reddit | 73 comments
LegacyRemaster@reddit
Is he crazy to say that?
Posted by pmv143@reddit | LocalLLaMA | View on Reddit | 203 comments
LegacyRemaster@reddit
Fed up with vibe coders, dev sneaks data-nuking prompt injection into their code
Posted by DeltaSqueezer@reddit | LocalLLaMA | View on Reddit | 135 comments
LegacyRemaster@reddit
Under 3 second time to first token, I literally don’t know what to add or do next for my local LLM. Can I get some input on ways to improve it?
Posted by Fear_ltself@reddit | LocalLLaMA | View on Reddit | 15 comments
LegacyRemaster@reddit
vLLM PR adding native HIP W4A16 kernel was merged
Posted by StupidityCanFly@reddit | LocalLLaMA | View on Reddit | 11 comments
LegacyRemaster@reddit
llama: use f16 mask for FA to save VRAM by am17an · Pull Request #23764 · ggml-org/llama.cpp
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 78 comments
LegacyRemaster@reddit
StepFun 3.7 Flash - Speed Benchmark in M5 Max
Posted by Beamsters@reddit | LocalLLaMA | View on Reddit | 13 comments
LegacyRemaster@reddit
StepFun 3.7 Flash - Speed Benchmark in M5 Max
Posted by Beamsters@reddit | LocalLLaMA | View on Reddit | 13 comments
LegacyRemaster@reddit
StepFun 3.7 Flash - Speed Benchmark in M5 Max
Posted by Beamsters@reddit | LocalLLaMA | View on Reddit | 13 comments
LegacyRemaster@reddit
StepFun 3.7 Flash
Posted by Everlier@reddit | LocalLLaMA | View on Reddit | 151 comments
LegacyRemaster@reddit
I've just benchmarked myself:
Posted by JLeonsarmiento@reddit | LocalLLaMA | View on Reddit | 171 comments
LegacyRemaster@reddit
The frontier reasoning race is starting to look like a crowded subway station
Posted by ExoticYesterday8282@reddit | LocalLLaMA | View on Reddit | 63 comments
LegacyRemaster@reddit
MiniMax M3 Is Coming Up
Posted by Few_Painter_5588@reddit | LocalLLaMA | View on Reddit | 7 comments
LegacyRemaster@reddit
SWE-rebench Leaderboard (March, April and May 2026): GPT-5.5, Opus 4.7, Cursor (Composer 2.5), Kimi K2.6 and More
Posted by CuriousPlatypus1881@reddit | LocalLLaMA | View on Reddit | 41 comments
LegacyRemaster@reddit
Looks like Miminax-M3 is just around the corner
Posted by OnkelBB@reddit | LocalLLaMA | View on Reddit | 40 comments
LegacyRemaster@reddit
Stop pretending self-hosting is cheaper. It's not. We do it for different reasons and we should say so.
Posted by Napster3301@reddit | LocalLLaMA | View on Reddit | 88 comments
LegacyRemaster@reddit
server: fix checkpoints creation by jacekpoplawski · Pull Request #22929 · ggml-org/llama.cpp
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 40 comments
LegacyRemaster@reddit
Coding Agent Tier List After Using These Across Real Production Codebases
Posted by Cute_Dragonfruit4738@reddit | LocalLLaMA | View on Reddit | 7 comments
LegacyRemaster@reddit
Gemma is so much better than Qwen, prove me wrong
Posted by Mountain_Patience231@reddit | LocalLLaMA | View on Reddit | 62 comments
LegacyRemaster@reddit
397B competitor that fits in 256 RAM?
Posted by quietsubstrate@reddit | LocalLLaMA | View on Reddit | 53 comments
LegacyRemaster@reddit
397B competitor that fits in 256 RAM?
Posted by quietsubstrate@reddit | LocalLLaMA | View on Reddit | 53 comments
LegacyRemaster@reddit
397B competitor that fits in 256 RAM?
Posted by quietsubstrate@reddit | LocalLLaMA | View on Reddit | 53 comments
LegacyRemaster@reddit
DeepSeek is pushing forward with $10.29 billion financing round, with Liang Wenfeng committing to continue developing open-source AI models rather than pursuing short-term commercialization goals
Posted by External_Mood4719@reddit | LocalLLaMA | View on Reddit | 119 comments
LegacyRemaster@reddit
AMD Powers Next-Generation Agent Computers with New Ryzen AI Halo Developer Platform and Ryzen AI Max PRO 400 Series Processors
Posted by Baumpaladin@reddit | LocalLLaMA | View on Reddit | 66 comments
LegacyRemaster@reddit
Waiting for Qwen 3.7 open weight... The new King has arrived...
Posted by LegacyRemaster@reddit | LocalLLaMA | View on Reddit | 284 comments
LegacyRemaster@reddit (OP)
Waiting for Qwen 3.7 open weight... The new King has arrived...
Posted by LegacyRemaster@reddit | LocalLLaMA | View on Reddit | 284 comments
LegacyRemaster@reddit (OP)
Waiting for Qwen 3.7 open weight... The new King has arrived...
Posted by LegacyRemaster@reddit | LocalLLaMA | View on Reddit | 284 comments
LegacyRemaster@reddit (OP)
Waiting for Qwen 3.7 open weight... The new King has arrived...
Posted by LegacyRemaster@reddit | LocalLLaMA | View on Reddit | 284 comments
LegacyRemaster@reddit (OP)
We're Thursday and no one claimed AGI yet this week!
Posted by oodelay@reddit | LocalLLaMA | View on Reddit | 70 comments
LegacyRemaster@reddit
Re. what ever happened to Cohere’s Command-A series of models?
Posted by nick_frosst@reddit | LocalLLaMA | View on Reddit | 102 comments
LegacyRemaster@reddit
Guardrails take an 8B model from 53% to 99% on agentic tasks [ACM CAIS '26 preprint]
Posted by billy_booboo@reddit | LocalLLaMA | View on Reddit | 14 comments
LegacyRemaster@reddit
Guardrails take an 8B model from 53% to 99% on agentic tasks [ACM CAIS '26 preprint]
Posted by billy_booboo@reddit | LocalLLaMA | View on Reddit | 14 comments