Academic or research source. Check the methodology, sample size, and whether it's been replicated.
Chatbot Arena Elo Rankings — Top 20 Models
#1 Gemini-2.5-Pro (Score: 1) #2 Gemini-2.5-Pro-Preview-05-06 (Score: 2) #3 GLM-4.5 (Score: 2) #4 Grok-4-0709 (Score: 2) #5 ChatGPT-4o-latest (2025-03-26) (Score: 3) #6 o3-2025-04-16 (Score: 3) #7...
LMArena Elo Rankings
·
·
~4 min read
2-Minute Brief
- According to LMArena Elo Rankings: #1 Gemini-2.5-Pro (Score: 1) #2 Gemini-2.5-Pro-Preview-05-06 (Score: 2) #3 GLM-4.5 (Score: 2) #4 Grok-4-0709 (Score: 2) #5 ChatGPT-4o-latest (2025-03-26) (Score: 3) #6 o3-2025-04-16 (Score: 3) #7 Qwen3-235B-A22B-Instruct-2507 (Score: 3) #8 DeepSeek-R1-0528 (Score: 3) #9 Grok-3-Preview-02-24 (Score: 4) #10 Llama-4-Maverick-03-26-Experimental (Score: 8) #11 GPT-4.5-Preview (Score: 8) #12 Qwen3-235B-A22B-Thinking-2507 (Score: 7) #13 chocolate (Early Grok-3) (Score: 8) #14 Gemini-2.5-Flash (Score: 1
8-Minute Deep Dive
Chatbot Arena Elo Rankings — Top 20 Models
TLDR
#1 Gemini-2.5-Pro (Score: 1) #2 Gemini-2.5-Pro-Preview-05-06 (Score: 2) #3 GLM-4.5 (Score: 2) #4 Grok-4-0709 (Score: 2) #5 ChatGPT-4o-latest (2025-03-26) (Score: 3) #6 o3-2025-04-16 (Score: 3) #7...
2-Minute Brief
- According to LMArena Elo Rankings: #1 Gemini-2.5-Pro (Score: 1) #2 Gemini-2.5-Pro-Preview-05-06 (Score: 2) #3 GLM-4.5 (Score: 2) #4 Grok-4-0709 (Score: 2) #5 ChatGPT-4o-latest (2025-03-26) (Score: 3) #6 o3-2025-04-16 (Score: 3) #7 Qwen3-235B-A22B-Instruct-2507 (Score: 3) #8 DeepSeek-R1-0528 (Score: 3) #9 Grok-3-Preview-02-24 (Score: 4) #10 Llama-4-Maverick-03-26-Experimental (Score: 8) #11 GPT-4.5-Preview (Score: 8) #12 Qwen3-235B-A22B-Thinking-2507 (Score: 7) #13 chocolate (Early Grok-3) (Score: 8) #14 Gemini-2.5-Flash (Score: 1
8-Minute Deep Dive
O open
S save
B back
M mode