Gemini Exp 1114 now ranks joint #1 overall on Chatbot Arena (that name though....)

Posted by lightdreamscape@reddit | LocalLLaMA | View on Reddit | 120 comments

Massive News from Chatbot Arena

u/GoogleDeepMind's latest Gemini (Exp 1114), tested with 6K+ community votes over the past week, now ranks joint #1 overall with an impressive 40+ score leap — matching 4o-latest in and surpassing o1-preview! It also claims #1 on Vision leaderboard.

Gemini-Exp-1114 excels across technical and creative domains:

- Overall #3 -> #1
- Math: #3 -> #1
- Hard Prompts: #4 -> #1
- Creative Writing #2 -> #1
- Vision: #2 -> #1
- Coding: #5 -> #3
- Overall (StyleCtrl): #4 -> #4

Huge congrats to @GoogleDeepMind on this remarkable milestone!

Check out the original thread

https://x.com/lmarena_ai/status/1857110672565494098?t=RdIOf2TycklRpHsH-9nl_w&s=07&fbclid=IwZXh0bgNhZW0CMTEAAR2twWnQtHrXI_6zt-cbVKRvC8VuTHMVsPT5M1lFUIeHQ49yaBAb-KUvfqk_aem_Gx6TX3uaCoKDTtc34NCpfg