Gemma 4 31B > Kimi K2.5 > Grok 4.20 on DuelLab's highest reasoning leaderboard

Posted by Goa_@reddit | LocalLLaMA | View on Reddit | 6 comments

Gemma 4 31B > Kimi K2.5 > Grok 4.20 on DuelLab's highest reasoning leaderboard

Gemma 4 31B: 53.9 score

Kimi K2.5: 50.5 score

Grok 4.20: 46.8 score

Funny to see the open Gemma 4 31B ahead of both.
Note, these scores are about writing competitive code...