BioBench: A Blueprint to Move Beyond ImageNet for Scientific ML Benchmarks

Samuel Stevens

arXiv:2511.16315·cs.CV·November 21, 2025

BioBench: A Blueprint to Move Beyond ImageNet for Scientific ML Benchmarks

Samuel Stevens

PDF

Open Access

TL;DR

BioBench introduces a comprehensive ecology-focused vision benchmark with diverse tasks and modalities, addressing the limitations of ImageNet in predicting scientific imagery performance and enabling more reliable AI-for-science evaluations.

Contribution

BioBench provides a unified, multi-task ecology vision benchmark with 9 tasks, 4 taxonomic kingdoms, and 6 modalities, offering a new standard for scientific machine learning evaluation.

Findings

01

ImageNet accuracy explains only 34% of variance on ecology tasks.

02

BioBench's diverse tasks better predict scientific imagery performance.

03

ViT-L models evaluate efficiently on BioBench, enabling scalable benchmarking.

Abstract

ImageNet-1K linear-probe transfer accuracy remains the default proxy for visual representation quality, yet it no longer predicts performance on scientific imagery. Across 46 modern vision model checkpoints, ImageNet top-1 accuracy explains only 34% of variance on ecology tasks and mis-ranks 30% of models above 75% accuracy. We present BioBench, an open ecology vision benchmark that captures what ImageNet misses. BioBench unifies 9 publicly released, application-driven tasks, 4 taxonomic kingdoms, and 6 acquisition modalities (drone RGB, web video, micrographs, in-situ and specimen photos, camera-trap frames), totaling 3.1M images. A single Python API downloads data, fits lightweight classifiers to frozen backbones, and reports class-balanced macro-F1 (plus domain metrics for FishNet and FungiCLEF); ViT-L models evaluate in 6 hours on an A6000 GPU. BioBench provides new signal for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsCell Image Analysis Techniques · Advanced Neural Network Applications · Domain Adaptation and Few-Shot Learning