GDPval-MM Benchmark Leaderboard

GDPval-MM is the multimodal variant of the GDPval benchmark, evaluating AI model performance on real-world economically valuable tasks that require processing and generating multimodal content including documents, slides, diagrams, spreadsheets, images, and other professional deliverables across diverse industries.

Leaderboard

Top 3 models on GDPval-MM Benchmark Leaderboard (scores from public evaluations).

Models tracked

Models with gdpval-mm in their evaluation profile.

  • No models linked yet.

View task leaderboards →