Cerebras REAPs: MiniMax-M2 (25, 30, 40%), Kimi-Linear 30%, more on the way!

Posted by ilzrvch@reddit | LocalLLaMA | View on Reddit | 22 comments

Hey everyone, we just dropped REAP'd MiniMax-M2 in 3 sizes:

https://hf.co/cerebras/MiniMax-M2-REAP-172B-A10B

https://hf.co/cerebras/MiniMax-M2-REAP-162B-A10B

https://hf.co/cerebras/MiniMax-M2-REAP-139B-A10B

We're running more agentic benchmarks for MiniMax-M2 REAPs, so far we're seeing good accuracy retention, especially at 25 and 30% compression.

We also recently released a Kimi-Linear REAP@30% and it works well for coding and for long-context QA:

https://hf.co/cerebras/Kimi-Linear-REAP-35B-A3B-Instruct

We're also working to get a Kimi-K2-Think REAP out, so stay tuned. Enjoy!