next MiniMax will be released in ~10 Days
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 48 comments
but will be probably too big for my setup
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 48 comments
but will be probably too big for my setup
Aroochacha@reddit
Well, I was put off n involuntarily vacation. No issues with licensing; can’t wait.
Ok_Warning2146@reddit
Good. At least MiniMax gives us a timeline unlike Qwen.
MDSExpro@reddit
As with MiniMax-2.7 vs MiniMax-2.5/2.1 - it's popularity will be dictated by license.
EveningIncrease7579@reddit
I have the impression that they know that at the end of the day, leaving it open is the best way to help make them known, than to leave it closed and ostracize it.
Howard_banister@reddit
Isn't that even bad? They (Minimax, Kimi, etc) spend millions developing these models, just for a company like Cursor to repackage Kimi 2.5 as "Composer" without attribution and take all the profit.
Open-source licensing needs to change to restrict these kinds of bad actors. It's supposed to help individuals and the community, not subsidize lazy companies looking for a free ride.
SalariedSlave@reddit
2.7 was basically doa, that license made sure not many people ever used it. let's see if they repeat that mistake.
ilintar@reddit
I like it that they're keeping the weights release deterministic, unlike what Qwen has adopted with their weird new marketing strategy of "tease to release".
jacek2023@reddit (OP)
Yes, I probably won't be able to run this model, but I wanted to point out that MiniMax is still open, while Qwen’s current status is unknown.
relmny@reddit
"unknown"? they released 3.6 about a month ago!
Why people expect that they release a model every week, or they start complaining that "they are no longer releasing OW models"... it happened after qwen2.5, qwen3, qwen3.5, now qwen3.6...
ilintar@reddit
Because they released 3.7 Max via API some time ago and this time there hasn't so far been any indication they plan on releasing any 3.7 model weights; also because for 3.5 they already cut down the 300+B model and for 3.6 there was no 120B, 9B, 4B or 1B.
ilintar@reddit
And of course they have a right to their strategy and I'm grateful for all models they released, but noticing that a company is moving away from an open-weight model when they are in fact moving away from it is not "whining", just noticing things.
_Asphadel@reddit
I slightly disagree with you. First of all, Qwen 3.6 Plus was released on March 30th, and the open-source model came out 20 days after that. Now that 3.7 Max is out, waiting about a month for the open-source version to drop is totally normal. Regarding Smaller vs. Larger Models First, imagine a 9B model came out at the beginning of March. Not even a month has passed between releases, and you're suggesting they put out a model that’s only 10% better than the previous one? What’s the point? Just so people like you can write, "Why did you even release this model? It's basically just like 3.5." There is zero sense in bothering to release a model that will be irrelevant from day one and only marginally better. In reality, they’d release a model only to have it get completely trashed, and on top of that, nobody would even use it. Why would they want that kind of bad PR? The Strategy for Big Models Next, regarding the large models, my assumption is pretty much the same. They revolutionized the game with the 27B and 35B models. Either they haven't gathered enough data yet to release a local 120B, 350B, or something similar, or they are holding back. What will you say if they release, say, Qwen 4 220B, which goes head-to-head with GPT-5.5 and Opus 4.6? You'll be kissing their feet! But if they release a Qwen 3.7 220B model right now that's only 10–20% better than their Qwen 3 model? You'll tear them apart and say you're never using their stuff again. I hope I've made my point clear.
sammcj@reddit
400b+ so not that useful for most people.
lantern_lol@reddit
Got a source? Or just assuming from results?
snapo84@reddit
from my testing i would "estimate" approx. 460B parameter and 46B active...
lantern_lol@reddit
That's a very specific "estimate" lmao
rowan_h@reddit
Yep too much for me
xeeff@reddit
available on the API? how is it? I bought their 10$ sub and really regretted it, using opencode go 5$/month for 60$ of inference rn
jacek2023@reddit (OP)
On this sub we pay for GPUs not for APIs.
xeeff@reddit
okay but i can't afford £5k of GPUs just to run AI
Clank75@reddit
I sympathise a lot. But that just means this isn't the sub for your question.
xeeff@reddit
sorry? I literally self host I just don't have enough for something as big as m3..? why u gatekeeping
Clank75@reddit
If you're not hosting the model locally it's not local.
It's not gatekeeping, it's in the name of the sub. It's in the rules of the sub ("No Off-topic posts: Content should be related to local AI"). The rules of the sub list many models you CAN run locally.
xeeff@reddit
sorry mr local AI police, please forgive me for using a cloud subscription for things which my local qwen3.6 27b is a little too slow/dumb for, how could I be so arrogant! won't happen again bossman
xeeff@reddit
@Clank75 wackass deleting his comments after blocking me 💀
Clank75@reddit
Sigh. When will I ever learn to just check for the hexagonal avatar and immediately block before engaging...
LegacyRemaster@reddit
I think this version will require a lot of GPUs. I don't think it's 200b.
rowan_h@reddit
Agreed
rawdikrik@reddit
If you regretted it, you arent using it correctly
xeeff@reddit
in that case I'll happily use ds-v4-pro/flash incorrectly instead
kmouratidis@reddit
Why? Quality? Speed? Usage caps?
paperbenni@reddit
M2.7 just isn't as good as DS4 flash, but it is more expensive. M3 doubles the cost of M2.7, I don't know how that doesn't bother anyone. Did M2.7 also launch at a higher price than what it is now? From what I can see this looks like a bigger model
chawza@reddit
Same here. M2.7 likes to eager to play aroudnd with pointless bash.
DS V4 flash is good enough with faster speed (inference and total output)
_cpatonn@reddit
I'm tired boss.
NewtMurky@reddit
It's 456B parameters, with 45.9 B activated.
I was hoping they will keep the same size, since it was a perfect match with 192GB VRAM setup. Now, it's too huge for local setup.
Practical-Collar3063@reddit
oh really ? have they publically announced the size of the model already ?
NewtMurky@reddit
Google says that, but now I think that it is incorrect - those numbers match M1 model.
LegacyRemaster@reddit
4M tokens ---> wrong
soyalemujica@reddit
59% agentic in swe bench? So same as Qwen 3.6 27B
svantana@reddit
What do you base that on? Qwen's own site puts it at 53.5.
soyalemujica@reddit
Swebench own site https://swe-rebench.com/?insight=feb_2026
svantana@reddit
It's confusing but I believe "SWE-Bench", "SWE-Bench Pro", and "SWE-rebench" are three totally different benchmarks from different people.
LegacyRemaster@reddit
We don't know the size yet, but the training data is larger. 27 vs. 200/300/600b makes a big difference.
soyalemujica@reddit
For world knowledge most definitely, however, for programming same use cases
LegacyRemaster@reddit
These days I'm using claude superskill with vscode+claudecode. Qwen 3.6 27b daily driver. Then I go down to 35b moe for fast things and go up to Mimo 2.5 for complex things.
Kahvana@reddit
Link to the post?
jacek2023@reddit (OP)
updated
LegacyRemaster@reddit
The main difference is the vision and context finally from 1M as mimo.