GeneBench Benchmark Leaderboard

GeneBench is an evaluation focused on multi-stage scientific data analysis in genetics and quantitative biology. Tasks require reasoning about ambiguous or noisy data with minimal supervisory guidance, addressing realistic obstacles such as hidden confounders or QC failures, and correctly implementing and interpreting modern statistical methods.

Leaderboard

Top 2 models on GeneBench Benchmark Leaderboard (scores from public evaluations).

Models tracked

Models with genebench in their evaluation profile.

  • No models linked yet.

View task leaderboards →