Leaderboards

Compare models
8 leaderboards5 categories27 ranked entries

coding

general

math

#ModelScore
1o183.3%
#ModelScore
1o194.8%
2Llama 3.3 70B77.0%
3Mistral Large 276.9%
4GPT-4o76.6%

multimodal

reasoning