TheaterFire

What’s the smartest NON thinking model under 40B or so?

Posted by Borkato@reddit | LocalLLaMA | View on Reddit | 14 comments

Seed 39B is excellent for thinking, but what about non-thinking?

Reply to Post

14 Comments

Long_comment_san@reddit

I'd vote for Magistral and Mistral series in general. They seem to be trained outstandingly well. I wish they made a 40-50b MOE model so I can just stop talking to people permanently.
View on Reddit #69837070

vasileer@reddit

gpt-oss-20b
View on Reddit #69590025

elbiot@reddit

That's a thinking model
View on Reddit #69710395

Brave-Hold-9389@reddit

Choose one from https://huggingface.co/spaces/flowers-team/StickToYourRoleLeaderboard Or this https://huggingface.co/spaces/DontPlanToEnd/UGI-Leaderboard Or one that appears in both
View on Reddit #69630188

Borkato@reddit (OP)

Wow, I haven’t seen that first one before, thank you!
View on Reddit #69665960

BigDry3037@reddit

Granite 4 small
View on Reddit #69561944

Badger-Purple@reddit

🤣🤣🤣
View on Reddit #69627369

BigDry3037@reddit

I’m assuming your emojis mean you disagree, but in all my evals granite 4 has been a real champion, what has your experience been?
View on Reddit #69628019

Klutzy-Snow8016@reddit

Maybe Qwen 3 VL 32B Instruct.
View on Reddit #69568950

TechNerd10191@reddit

I have no complains for the Gemm3 series.
View on Reddit #69558446

Demortus@reddit

I like them, but they do tend to halucinate fairly frequently, in my experience.
View on Reddit #69566632

sine120@reddit

Qwen3-30B-A3B-Instruct-2507 is my favorite.
View on Reddit #69559863

spliznork@reddit

Just curious, if you like Seed 36B, you can set its thinking budget to 0 to disable thinking. Is your hypothesis that explicit non-thinking models may do better than this? Or, what are you looking for?
View on Reddit #69557082

Borkato@reddit (OP)

Can I do this in ooba and sillytavern? If so how wtf
View on Reddit #69557378