LegacyRemaster

Waiting for Qwen 3.7 open weight... The new King has arrived...

Posted by LegacyRemaster@reddit | LocalLLaMA | View on Reddit | 282 comments
Testing MiMo-V2.5-IQ3_S with 1'048'576 context

Posted by LegacyRemaster@reddit | LocalLLaMA | View on Reddit | 17 comments
New models when? Forecasting release date.

Posted by LegacyRemaster@reddit | LocalLLaMA | View on Reddit | 84 comments
Trying Mimo 2.5 pro with RTX 6000 96gb + W7800 48gb + 128gb DDR4

Posted by LegacyRemaster@reddit | LocalLLaMA | View on Reddit | 7 comments
Cuda + ROCm simultaneously with -DGGML_BACKEND_DL=ON !

Posted by LegacyRemaster@reddit | LocalLLaMA | View on Reddit | 27 comments
XiaomiMiMo MiMo-V2.5 (not pro) - Architecture: Sparse MoE (Mixture of Experts), 310B total / 15B activated parameters

Posted by LegacyRemaster@reddit | LocalLLaMA | View on Reddit | 16 comments
Is it possible to edit LLAMA.CPP with Cline+Vscode+Minimax 2.7 Q4_K_S and get a working build?

Posted by LegacyRemaster@reddit | LocalLLaMA | View on Reddit | 13 comments
Where we are. In a year, everything has changed. Kimi - Minimax - Qwen - Gemma - GLM

Posted by LegacyRemaster@reddit | LocalLLaMA | View on Reddit | 30 comments
Minimax 2.7: good news!

Posted by LegacyRemaster@reddit | LocalLLaMA | View on Reddit | 41 comments
FT - China’s Alibaba shifts towards revenue over open-source AI

Posted by LegacyRemaster@reddit | LocalLLaMA | View on Reddit | 132 comments
Qwen 3.5 397B vs Qwen 3.6-Plus

Posted by LegacyRemaster@reddit | LocalLLaMA | View on Reddit | 74 comments
We have a new weight class...

Posted by LegacyRemaster@reddit | LocalLLaMA | View on Reddit | 123 comments
Minimax 2.7: Today marks 14 days since the post on X and 12 since huggingface on openweight

Posted by LegacyRemaster@reddit | LocalLLaMA | View on Reddit | 81 comments
I understand the disappointment if minimax 2.7 does not become open weights but we have had a lot..

Posted by LegacyRemaster@reddit | LocalLLaMA | View on Reddit | 28 comments
Trellis 2 run locally: not easy but possible

Posted by LegacyRemaster@reddit | LocalLLaMA | View on Reddit | 22 comments
Minimax 2.7: If the model updates itself, who is responsible for the behavior of the version running in production?

Posted by LegacyRemaster@reddit | LocalLLaMA | View on Reddit | 7 comments
Claude Code with LM studio: 0.4.1

Posted by LegacyRemaster@reddit | LocalLLaMA | View on Reddit | 13 comments
Testing Fine-tuning Studio

Posted by LegacyRemaster@reddit | LocalLLaMA | View on Reddit | 1 comments
Is memory speed everything? A quick comparison between the RTX 6000 96GB and the AMD W7800 48GB x2.

Posted by LegacyRemaster@reddit | LocalLLaMA | View on Reddit | 18 comments
Model: support GLM-OCR merged! LLama.cpp

Posted by LegacyRemaster@reddit | LocalLLaMA | View on Reddit | 7 comments
The impossible marriage: GLM 5 unsloth TQ1 test on 1xRTX 6000 + 2xW7800 48gb.

Posted by LegacyRemaster@reddit | LocalLLaMA | View on Reddit | 3 comments
I tested qwen next 80b instruct with Claude Code on llama.cpp. I created the Tetris game and it worked 100% locally.

Posted by LegacyRemaster@reddit | LocalLLaMA | View on Reddit | 46 comments
Qwen3.5-397B-A17B : a significant step forward in many benchmarks but still too many hallucinations

Posted by LegacyRemaster@reddit | LocalLLaMA | View on Reddit | 16 comments
Qwen3.5-397B-A17B will be open source!

Posted by LegacyRemaster@reddit | LocalLLaMA | View on Reddit | 27 comments
From Minimax: Forge: Scalable Agent RL Framework and Algorithm

Posted by LegacyRemaster@reddit | LocalLLaMA | View on Reddit | 1 comments
Ring-1T-2.5 Trillion Parameter Open Source Thinking Model

Posted by LegacyRemaster@reddit | LocalLLaMA | View on Reddit | 0 comments
While we wait for Deepseek 4, Unsloth is quietly releasing gguf for 3.2...

Posted by LegacyRemaster@reddit | LocalLLaMA | View on Reddit | 12 comments
Cerebras MiniMax-M2.1-REAP-139B-A10B - Mradermacher Q4_K_S tested

Posted by LegacyRemaster@reddit | LocalLLaMA | View on Reddit | 23 comments
Scrolling through the trending list on huggingface I found LightOnOCR-2-1B ....

Posted by LegacyRemaster@reddit | LocalLLaMA | View on Reddit | 4 comments
Model: cerebras/GLM-4.7-REAP-268B-A32B incoming!

Posted by LegacyRemaster@reddit | LocalLLaMA | View on Reddit | 50 comments
OpenAI has signed a $10 billion contract with Cerebras

Posted by LegacyRemaster@reddit | LocalLLaMA | View on Reddit | 37 comments
Tested Glm-4.7-REAP-40p IQ3_S . Single RTX 6000. Works

Posted by LegacyRemaster@reddit | LocalLLaMA | View on Reddit | 15 comments
XiaomiMiMo.MiMo-V2-Flash: is there a reason why i see so few ggufs?

Posted by LegacyRemaster@reddit | LocalLLaMA | View on Reddit | 36 comments
KT-Kernel achieves up to >4.5x prefill and 30% faster decode compared to llama.cpp on the same hardware , why?

Posted by LegacyRemaster@reddit | LocalLLaMA | View on Reddit | 8 comments
Minimax M2.1 is out!

Posted by LegacyRemaster@reddit | LocalLLaMA | View on Reddit | 47 comments
I was waiting for Minimax and MiMo-V2-Flash arrived!!!

Posted by LegacyRemaster@reddit | LocalLLaMA | View on Reddit | 15 comments
ik_llama GLM 4.7 : 8~9 tokens/sec (ubergarm) instead of 4.5~5 tokens/sec (llama.cpp)

Posted by LegacyRemaster@reddit | LocalLLaMA | View on Reddit | 6 comments
Qwen3-Next-80B-A3B-Thinking-GGUF has just been released on HuggingFace

Posted by LegacyRemaster@reddit | LocalLLaMA | View on Reddit | 52 comments
We basically have GLM 4.6 Air, without vision

Posted by LegacyRemaster@reddit | LocalLLaMA | View on Reddit | 14 comments
I just wanted to do a first benchmark of GLM 4.6 on my PC and I was surprised...

Posted by LegacyRemaster@reddit | LocalLLaMA | View on Reddit | 35 comments
MiniMax-M2 Asteroid game - Unsloth

Posted by LegacyRemaster@reddit | LocalLLaMA | View on Reddit | 9 comments
Minimax M2 thinks it's GPT...

Posted by LegacyRemaster@reddit | LocalLLaMA | View on Reddit | 4 comments
I'm testing the progress on GitHub. Qwen Next gguf. Fingers crossed.

Posted by LegacyRemaster@reddit | LocalLLaMA | View on Reddit | 15 comments
8 GPU Arc Pro B60 setup. 192 gb Vram

Posted by LegacyRemaster@reddit | LocalLLaMA | View on Reddit | 14 comments