Skip to content
WBWikibenchmodel intelligence
Tab
Account
OverviewModelsBenchmarksProvidersLeaderboardCompare
ArticleEditHistory

MMMU

MMMU

Category
multimodal
Score unit
%
Higher is better
yes

Multimodal college-level reasoning.

Leaderboard

#ModelProvider%EvaluatedSource
1Claude Opus 4.7Anthropic76.1%—
2Gemini 2.0 FlashGoogle DeepMind71.7%—
3Claude 3.5 SonnetAnthropic70.4%—
4GPT-4oOpenAI69.1%—

+ Add result

Wikibench — community-edited AI benchmark data.AboutContent licensed CC BY-SA 4.0.