TheaterFire

Mistral's been quiet lately...

Posted by umarmnaq@reddit | LocalLLaMA | View on Reddit | 127 comments

Mistral's been quiet lately...

Reply to Post

127 Comments

Mr_Moonsilver@reddit

Here we go... Small 3 dropped
View on Reddit #47238849

Dark_Fire_12@reddit

Soon https://preview.redd.it/q54016w8a59e1.png?width=598&format=png&auto=webp&s=8b81656192203632a0b096b1538799be1d8f192a
View on Reddit #43991661

procgen@reddit

"In the coming weeks."
View on Reddit #44032468

SocialBudai@reddit

Mistral seems to be the one so far. They made me happy. It's like blizzard with Diablo 1.
View on Reddit #44799130

silenceimpaired@reddit

I would almost be interested if Qwen didn’t have better performance and licensing across the board for my use.
View on Reddit #44012474

nrkishere@reddit

Unless backed by hardware/infrastructure providers, it is very hard for a AI company to survive. Mistral have almost a billion in funding, but none of the investors are the likes of microsoft, google or amazon. Lately alibaba and google have significantly raised the bar. Facebook is also in the race, IBM has joined recently. All of them have open source or semi-open source models. So I think no one really cares about mistral at this moment, unless they pull off something comparable to o1+
View on Reddit #43990494

zitr0y@reddit

> IBM has joined recently And their 2b model us surprisingly good. I was trying out a dozen models for a sentiment analysis task and theirs came a close second for that task after qwen2.5:3b (better than qwen2.5 7b, llama 3.1 8b and many more surprisingly)
View on Reddit #43996784

Bitter-Good-2540@reddit

Which 2b model?
View on Reddit #44031129

zitr0y@reddit

It is called granite3.1-dense
View on Reddit #44031197

Bitter-Good-2540@reddit

Thanks! You tried to use it for local CPU rag?
View on Reddit #44048153

zitr0y@reddit

No, I gave it a number (>200k) of German sentences with rapper names in them and made it categorize how positively or negatively the sentiment in the sentences is in regards to the rapper (only giving out a number between 1 and 5). I ran on GPU via ollama and its python integration. Feel free to ask more questions about it, I'm currently writing the research paper :D
View on Reddit #44048390

Willing_Landscape_61@reddit

Did you compare with Bert models? Is seems to me that LLMs aren't the right tool for the job of text classification. (It's not like you are actually generating text).
View on Reddit #44085232

zitr0y@reddit

You make a good point. In my class, it wasn't really made that clear what Bert actually does, I thought it was just an earlier, worse version of LLMs still used as a baseline in research. But it would likely have been a more fitting tool for the task. That said, qwen 2.5 3b did decently overall, with 65% perfect agreement and 95% off-by-one classification, zero shot.
View on Reddit #44309628

FPham@reddit

Let's face it, once google realised they had the know how all the time, it went pretty well with Gemini...
View on Reddit #44143056

Massive_Robot_Cactus@reddit

"Facebook is also in the race" Bruh.
View on Reddit #43991398

nrkishere@reddit

yes they are. Nowhere said they are making state of the art models, but based on any leaderboard, LLAMA models are ahead of mistral's and this is the point of the argument.
View on Reddit #43991777

Bitter-Good-2540@reddit

Llama is often used for fine tunes 
View on Reddit #44033307

FlerD-n-D@reddit

It's the other way around. He's saying you're understating what Facebook is doing.
View on Reddit #43998118

Massive_Robot_Cactus@reddit

Yup, I could have been clearer. Just because Meta doesn't have a large cloud business doesn't mean they don't have one of the 5 largest data center footprints (and GPU compute) in the world.
View on Reddit #44021658

thereisonlythedance@reddit

Mistral have provided the best all round local model in actual use (Mistral Large) and nobody cares about them? No. If nobody cared this thread wouldn’t exist.
View on Reddit #44004300

silenceimpaired@reddit

Their licensing is a big speed bump for me and performance isn’t big enough to switch from Qwen and llama 3.3
View on Reddit #44012572

fka_nate@reddit

But Mistral is literally funded by amazon? and Nvidia? might wanna do some more research on them lmao They’ve also released some really great models (esp for the gpu poor).
View on Reddit #43991189

LevianMcBirdo@reddit

You know how much stuff these companies fund and how little goes to Mistral in the ai sector?
View on Reddit #43991718

nrkishere@reddit

**Strategic partnership** is nothing like funding. Every cloud providers are partners of every open source models, because it brings them revenue. The primary investors, according to crunchbase are Andreessen-horowitz, salesforce and bnp paribas
View on Reddit #43991645

Only-Letterhead-3411@reddit

I hope EU AI Act won't be the end of Mistral. I feel like Mistral really lost traction after that BS.
View on Reddit #43990341

medialoungeguy@reddit

Their commitment to overregulating will be their last move.
View on Reddit #43990655

lleti@reddit

It has been deeply deeply painful to watch us regulate ourselves into irrelevancy.
View on Reddit #44000181

Feisty-Pay-5361@reddit

Well, on the flip side - EU (parts of it anyway) will also be the only place where realistically some form of UBI or monetary support for unemployment will happen. If mass job loss starts in the future due to all the unregulated AI rapidly advancing, citizens of US or Asia are absolutely screwed compared to Europeans (at least Nordic countries will for sure do ok, some others might join in).
View on Reddit #44017101

CNWDI_Sigma_1@reddit

There is no money even for pensions, we are in a pension crisis. UBI would require at least 10x to 20x as much.
View on Reddit #44113695

lleti@reddit

> the only place where realistically some form of UBI or monetary support for unemployment will happen. With what money tho We've sorta regulated outselves out of every major/cutting-edge industry, and a lot of our talent have left shore for the US or the Middle East to enjoy 4x the salary and 0.1x the taxes. Coupled with that, the Euro has been in steep decline against the USD since the financial crisis, with no sign of relief. Unfortunately I don't think there's gunna be anyone to actually pay for UBI on our shores.
View on Reddit #44023447

Disastrous-Peak7040@reddit

JD & Elon are fans of UBI and minimum wage funded by innovation. They say "put kiosks in McDonalds, make more profit, pay better wages, build more restaurants". They're official on raising Fed min wage. The old school conservatives hate it. We may be entering a new pro-tech, pro-worker era?
View on Reddit #44050655

lleti@reddit

Could be in the US The EU will likely regulate & fully outlaw anything which automates away a single job, even if it could fund thousands of UBI recipients in return. We've sorta wrecked ourselves in that regard tbh, we have bureaucrats who outlaw technology without even understanding the bare basics about it.
View on Reddit #44051271

Feisty-Pay-5361@reddit

Well I am not saying the chances are good I am just saying proportionally it will likely be the only place where governments \*might\* give a fuck to come up with some system.
View on Reddit #44029714

lleti@reddit

Yeah, I'm sure they'll try to do something or other - but it's just as likely to accelerate the collapse. Can't just print euros to feed the masses when there's nothing of value backing them.
View on Reddit #44041826

procgen@reddit

Eh, a lot of Americans have 401ks and they'll be absolutely raking it in.
View on Reddit #44032295

AssistBorn4589@reddit

> Well, on the flip side - EU (parts of it anyway) will also be the only place where realistically some form of UBI or monetary support for unemployment will happen. That is not flip side, that is getting fucked while getting fucked.
View on Reddit #44043891

ohio_rizz_rani@reddit

Why thought? Isn't it better for them especially since the act itself talks about giving insights into data used and their models are open source. I think this is an advantage for then in the EU region it's also home grown company so I don't see why The EU AI act is a speed breaker.
View on Reddit #43991028

Only-Letterhead-3411@reddit

The thing is EU AI Act is an hindrance put on Mistral's back in the AI race. While companies like OAI and Anthropic train their models on everything they can get their hands on, Mistral is forced to only use data they own themselves. These closed-source models are very good because they are trained on a lot of copyrighted data. I mean, previous year ChatGPT was giving people working windows license keys when asked. I think OpenAI is the proof that even the professional customers don't care about transparency and explainability, they care about quality and performance
View on Reddit #43992774

goingsplit@reddit

It's plain obvious even in whisper to someone non-knowledgeable like me.. At the end of each inference the model spits a note about subtitles... Meaning it has been trained with copyrighted movies and subtitles produced by people
View on Reddit #44100743

woutertjez@reddit

As someone that works in a large EU headquartered MNC, I can confirm accountability and transparency trumps model power. GDPR and other data/digital related acts in the EU are no joke when it comes to fines. We’re talking multiple percentage points of **global** turnover.
View on Reddit #44002231

InertialLaunchSystem@reddit

What's wild to me is the EU thinking they could enforce this without risking a global trade war. Imagine trying to fine Apple or Nvidia at 4% of global *turnover* (not even profit) - that is a very fast way to end up on the losing side of a terrible trade war.
View on Reddit #44045032

ohio_rizz_rani@reddit

What ever you say it's 100% valid, I agree we live in a capitalistic world. What I am also saying is that there are certain industries like finance, healthcare, pharma where transparency and explainability plays a huge role. Mistral still has a very good chance.
View on Reddit #43993104

InertialLaunchSystem@reddit

If it prevents us from reaching AGI, copyright will have outlived its purpose and will cause more harm than good.
View on Reddit #44044954

MorallyDeplorable@reddit

Copyright irself is unethical.
View on Reddit #44004694

Many_SuchCases@reddit

That part in the EU AI act also means not breaking copyright, which is a question most companies aren't ready to answer. And the need to give insight is only a small part of the act.
View on Reddit #43991477

GraceToSentience@reddit

Baseless Bs: "The AI Act **introduces limited exceptions for text and data mining**, recognizing the importance of balancing copyright protection with promoting innovation and research." [https://keanet.eu/eu-ai-act-shaping-copyright-compliance-in-the-age-of-ai-innovation/#:\~:text=The%20AI%20Act%20introduces%20limited,with%20promoting%20innovation%20and%20research](https://keanet.eu/eu-ai-act-shaping-copyright-compliance-in-the-age-of-ai-innovation/#:~:text=The%20AI%20Act%20introduces%20limited,with%20promoting%20innovation%20and%20research)
View on Reddit #44052310

ohio_rizz_rani@reddit

I don't think it's necessarily bad , because companies like mistral will always have customers in heavily regulated industries where transparent and explainability plays a huge role.
View on Reddit #43991768

SpargeOase@reddit

The customers are paying for the best models. You can't make the best models if you don't have the best quality data. 'Training data' transparency doesn't bring any benefits for most of the end users. We, Europeans, are just coping with this heavy regulation bullshit.
View on Reddit #43993252

Nyghtbynger@reddit

What's sad is that Eurocrats **do believe** law makes money. In Luxembourg they do money with copyrights and when leveraging patent. That's an horrible way of doing money that will be made irrelevant in the few next years like the European Union It seems
View on Reddit #43998551

brotie@reddit

https://i.redd.it/6hvsku7d579e1.gif
View on Reddit #44007316

LevianMcBirdo@reddit

Exactly, having AI that complies with the rules gives you a giant market pretty much for yourself. Also the EU AI act is still not enforced yet (most stuff has a two year period, so 2026) and still Mistral is quiet now for months.
View on Reddit #43991679

Any_Elderberry_3985@reddit

Maybe in Europe but the rest of the world including the US does not care about "the rules" as their is currently no legal risk outside of US IMO, Europe acted too quickly and likely gutted any development from Europe. Don't worry though non European companies will gladly gobble the data and train on it.
View on Reddit #44010502

ThenExtension9196@reddit

Engineers came to America to get paid top dollar. Eu is no place to develop tech.
View on Reddit #43991490

Nyghtbynger@reddit

Do you know how much an average data engineer/scientist is paid after taxes in France? 3500€ or 4000USD per month lol. Or make an effort. You can be top whatever, the state need to pay all the pensions from the old foggies and various welfare programs. That's the only country on earth where retirees earn more than working people. And no one is shocked when you tell them. Culture is good, food is good, cities are top class but doing business and working in France is one of the shittiest thing imaginable in the country, making the upsides *unaffordable*. People sometimes says that the US is a third-world country when it comes to catering to the people and the infrastructure. France is third-world when it comes to not being disappointed when starting an innovative project. No wonder the greatest minds are f*cking fleeing the country ~~I will too~~. Entrepreneur is a french word lmao
View on Reddit #43998307

Original_Bend@reddit

3500€ a month after taxes is in the top tier for a data engineer, maybe in Paris.
View on Reddit #44035002

4sater@reddit

Seriously? Wtf.
View on Reddit #44043399

Nyghtbynger@reddit

And you need 4500€ after taxes to live comfortably in Paris...
View on Reddit #44066416

4sater@reddit

Why the IT salaries are so low in France? I mean you can make 3500 euro after tax as a software/ML engineer even in some developing countries like China or Russia with much lower cost of living... is there a huge brain drain from France to the US and neighbouring countries like UK or Netherlands which afaik have higher salaries for skilled engineers?
View on Reddit #44079958

Nyghtbynger@reddit

Being a software engineer in France is disregarded. Anyone touching a computer is basically an untouchable (indian Dalit). I've seen people that are geniuses in their field, not getting any job because they didn't do the right study or didn't stay long enough in their previous companies, or just don't know the right tool. People with lesser skills but some 6 months training in the tool via some training organization whose boss knows the hiring company have more chance of being hired. That's like a cartel where the manager are some nerdy assholes with a comp sci degree and they only hire asshole. If I seem salty it's because I had to face this kind of people that are average everywhere, except for their big egos. Now looking for options in "third world countries". In Thailand or Malaysia, I could earn 1:1 salary in euro compared to what you earn in smaller towns. That's approx 2.3 times better
View on Reddit #44098815

4sater@reddit

Wow, that's shitty and, tbh, really stupid considering that huge chunks of economy are becoming more digital and are running on software engineers. Not to mention that France is just shooting itself in the foot in AI race... As a French citizen, could you try to go to Netherlands or perhaps Germany? They have better salaries for software developers & AFAIK you don't need a work visa since you are a EU member? Good luck!
View on Reddit #44099199

Nyghtbynger@reddit

Thanks for your words. I have french and thainayionalities. 🤭 That's even better ! I tried in Germany (I speak German) but their GDP dropped 10% since they don't have cheap energy anymore. They don't really hire foreigners right now... I still believe the current situation in France is a waste. The country basically have free nuclear energy and produces a lot of scientists (in data too). Having good AI should be an evidence. But I must be realistic, France is in cultural decline for one century now. They can't imagine themselves without being a global power that relies on a long gone colonial hinterland. Time for the big questioning and some practices changes.. If that doesn't work I can still changes carrier and become a plumber. Fine by me 🤷‍♀️
View on Reddit #44100383

Josh_j555@reddit

> Culture is good, food is good, cities are top class This is quickly changing as well, sadly not for the better.
View on Reddit #44016765

MorallyDeplorable@reddit

> 3500€ or 4000USD per month Is that supposed to be impressive?
View on Reddit #44004756

Bitter-Good-2540@reddit

It's a place to develop. Get you degree for cheap. Got government funding and move to USA. Blackforest did that lol
View on Reddit #44030965

GraceToSentience@reddit

The EU AI Act is a self reported thing much like AI regulations in the USA People don't know what it does and think it's some kind of tough regulation. It's not.
View on Reddit #44052482

mahithefish@reddit

Yeah as much as I’m behind that Act, it’s a very very tough constraint for them to remain competitive but maybe they can win in EU. 
View on Reddit #44039693

anonynousasdfg@reddit

In the worst case with enough investor support they may move their headquarters to U.S, although I'm not sure if it will help them in the long run to become an independent company without being acquired by some closed-source property giants or will just make them bankrupt.
View on Reddit #43995362

lilythompsilly@reddit

Pixtral 12B released recently is quite awesome too. It is exceptional at doing OCR and interpretation related tasks.
View on Reddit #44096428

Willing_Landscape_61@reddit

They probably are frantically reading the DeepSeek 3 paper right now!
View on Reddit #44085281

BetEvening@reddit

daily reminder: https://preview.redd.it/1wwc7gqpkc9e1.png?width=968&format=png&auto=webp&s=c5c80740aac85c931c2afd856dc62400a78b422a :
View on Reddit #44068009

PrinceOfLeon@reddit

Mistral is currently in the process of opening a Bay Area office. I wonder if they'll incorporate separately there in order to get around the EU's restrictions on AI. Personally I lost interest in following them after they stopped releasing under open licenses.
View on Reddit #43993042

Nyghtbynger@reddit

I would have opened a Shenzen or Texas office instead
View on Reddit #44066524

MoffKalast@reddit

EU: It's treason then.
View on Reddit #43997853

Ok_Wear7716@reddit

It’s time for annual 8 week holiday in France, so it makes sense
View on Reddit #44059643

Zangwuz@reddit

Not really, Pixtral Large was released just one month ago.
View on Reddit #43990993

ninjasaid13@reddit

yeah but they've never been quiet for a whole month besides this january and last year october and november.
View on Reddit #44052194

FortranUA@reddit

And need to say it's quite good, in some moments even better then gpt4o in describing images
View on Reddit #43993812

infernys20@reddit

*than
View on Reddit #43994710

FortranUA@reddit

Thanx 😊 I was very sleepy when I wrote
View on Reddit #43998866

JzTheLazy@reddit

Gn mate 😴😴
View on Reddit #44001257

umarmnaq@reddit (OP)

That's a millennium in AI time
View on Reddit #43991192

TheDreamWoken@reddit

Ok
View on Reddit #43999555

MoffKalast@reddit

It has been 84 years...
View on Reddit #43997814

Such_Advantage_6949@reddit

I dont even know what you complain about. Why not asking meta and google to release more who also have more resources? Mistral released pixtral large just recently. Whereas meta and google both doesnt release too end model. The only company that released more is alibaba with their qwen series.
View on Reddit #43993178

Healthy-Nebula-3603@reddit

Google yes . But meta announced llama 4 and soon will release also 2 weeks ago released the llama 3.3 model.
View on Reddit #43994839

Such_Advantage_6949@reddit

Dont think llama 4 released date is confirmed yet right. For 3.3 is more incremental update, whereas their 3.2 vision part is not as good as competitor. In comparison, qwen released good vision model and reasoning model. Pixtral have good vision capabilities. To be honest, i am sure they are capable of release something better. But it feels like the bigger player is intentionally holding back
View on Reddit #43994986

kif88@reddit

Last I heard was Mark Zuckerbergs video on Facebook. He said llama 4 should be out "in 2025"
View on Reddit #44023355

Such_Advantage_6949@reddit

Out in 2025 is not exactly a release date lol
View on Reddit #44048713

Healthy-Nebula-3603@reddit

If you read recent papers from meta and if they implemented that in llama 4 ... then will be wild 😅
View on Reddit #43995258

Dark_Fire_12@reddit

It's a bit, they do ask for them as well. The rotation is Mistral > Meta (Llama) > Google (Gemma) > Cohere. We got 3.3 from Meta and a new updated Paligemma from Google, as well as a 7B from Cohere. Mistral is next up.
View on Reddit #43994813

Such_Advantage_6949@reddit

Haha i nvr realise there is a circle of release lol. Lets see
View on Reddit #43995029

Mother-Ad-2559@reddit

Let them cook
View on Reddit #44043437

FantasticRewards@reddit

Mistral Large is still my favorite model but would love a new Miqu (70B).
View on Reddit #44042934

martinerous@reddit

A Mistral-not-so-small-and-not-that-large would be nice. 32B is the sweet spot for me. I really like the current Mistral Small model for its overall consistency when prompted to follow long step-by-step interactive scenarios. In comparison, other models (even Qwen 32B) mix up the steps or items or interpret the instructions in abstract manner. Mistral Small is the most solid, but +10B would benefit it, I think.
View on Reddit #44034782

johnny1064@reddit

Maybe Mistral’s busy cooking. Unlike OpenAi who’s too busy talking, then gets their ass handed to them by Google. (Veo2)
View on Reddit #43990810

Spammesir@reddit

I get your point about SORA but o3's definitely good
View on Reddit #43993285

EquivalentIce7442@reddit

How do you know? They didn't release it..
View on Reddit #43993339

procgen@reddit

How do we know? The benchmarks results, obviously.
View on Reddit #44010262

Few_Painter_5588@reddit

Those benchmarks were flubbed by basically giving the model infinite time and resources to think.
View on Reddit #44023696

procgen@reddit

That's either a misunderstanding on your part or a blatant lie: https://arcprize.org/blog/oai-o3-pub-breakthrough Time per task was ~13 mins on the semi-private eval.
View on Reddit #44024544

EquivalentIce7442@reddit

Ahahah yeah we'll see when and if..
View on Reddit #44012558

procgen@reddit

What do you mean? Francois Chollet already confirmed it, lol.
View on Reddit #44012587

EquivalentIce7442@reddit

I'd like to try it then we'll see.. 
View on Reddit #44012646

procgen@reddit

The fact remains that no other model has come close on the ARC-AGI or frontier math benchmarks. The reason you can't use it now is because it's absurdly expensive to run, but the costs will drop fast.
View on Reddit #44012722

squareOfTwo@reddit

did you try it? The answer is no
View on Reddit #43993369

procgen@reddit

o3 is outperforming humans on ARC-AGI, lol. They have the most powerful research model that's been publicly revealed.
View on Reddit #44010299

fallingdowndizzyvr@reddit

I think they have to tread lightly and carefully with the new EU regs to worry about.
View on Reddit #44023923

candre23@reddit

Not really. They dropped a new version of the 22b in September. October was a new 8b. A month ago we got two new versions of largestral - with and without image support. I know this space moves fast, but going one whole month without a new model is hardly "sleeping".
View on Reddit #44002013

kif88@reddit

And that 3b ministral they keep behind API.
View on Reddit #44022992

ForsookComparison@reddit

dying for a new codestral
View on Reddit #44022760

pigeon57434@reddit

you know who has really been totally silent? Anthropic. I wonder what they will do Claude 3.5 was a fucking beast but they havent released the next gen models yet and are behind now
View on Reddit #44013568

Any_Elderberry_3985@reddit

They released pix large not long ago. They don't get much press anymore because there are other good models and they have no commercial use without licensing.
View on Reddit #44011780

silenceimpaired@reddit

Came here to say this. My interest in them died the moment they switched to a license like this… especially since their dataset is probably based off the work of others without their consent.
View on Reddit #44012176

Aurelien-Morgan@reddit

https://preview.redd.it/8by76fi2h79e1.png?width=454&format=png&auto=webp&s=7341afc394e2ed24c9dd541bd1fd8bf18cf238b1 Also, now :
View on Reddit #44011205

lolwutdo@reddit

Need a reasoning Mistral model
View on Reddit #44006814

Great_Currency_3998@reddit

Mistral is dead. Just look at who has money. Their top people left for better pay. Simple as that. Hence, they're braindead.
View on Reddit #43992641

CheatCodesOfLife@reddit

How is Mistral dead? They have the best open weights models (Mistral-Large-2411 and Pixtral-Large)
View on Reddit #44003366

Amgadoz@reddit

Mention 5 people who left mistral
View on Reddit #43992782

Illustrious-Lake2603@reddit

Really wishing for Codestral 2, a 7b parameter that outperforms Qwen Coder 2.5 32b. That would make Christmas complete
View on Reddit #43996705

Combinatorilliance@reddit

Codestral is amazing!
View on Reddit #43998887

DarKresnik@reddit

Come on Mistral, do it like OpenAI and Google. Copy Chinese models, make some changes and go...
View on Reddit #43990768

Nyghtbynger@reddit

Knowing the French, they will never lol
View on Reddit #43998648

darkplaceguy1@reddit

One month in irl, 1 year in AI terms.
View on Reddit #43996911

randomrealname@reddit

Have you used it recently? They have a pretty decent reson9ng model in the chat just now.
View on Reddit #43996695

RAGcontent@reddit

i can't find the post anymore but I saw that they were starting some sort of incubator and offering credits for startups
View on Reddit #43994891

mlon_eusk-_-@reddit

They are bringing subscription services, like chatgpt, so it is most likely that they will launch a new better model with subscription anyways
View on Reddit #43991535

Many_SuchCases@reddit

yeah, come on Mistral, we know you're reading this!
View on Reddit #43990475