MCP Atlas Benchmark Leaderboard

MCP Atlas is a benchmark for evaluating AI models on scaled tool use capabilities, measuring how well models can coordinate and utilize multiple tools across complex multi-step tasks.

Leaderboard

Top 18 models on MCP Atlas Benchmark Leaderboard (scores from public evaluations).

Models tracked

Models with mcp-atlas in their evaluation profile.

  • No models linked yet.

View task leaderboards →