{\mu}-Bench: A Vision-Language Benchmark for Microscopy Understanding
Alejandro Lozano, Jeffrey Nirschl, James Burgess, Sanket Rajan Gupte,, Yuhui Zhang, Alyssa Unell, Serena Yeung-Levy

TL;DR
{b0}-Bench is a comprehensive, expert-curated benchmark for evaluating vision-language models in microscopy, revealing current limitations and proposing solutions to improve biomedical image understanding.
Contribution
The paper introduces {b0}-Bench, the first large-scale, diverse microscopy vision-language benchmark, and evaluates models, highlighting challenges and potential strategies like weight interpolation.
Findings
Current models perform poorly across microscopy tasks.
Specialist models often underperform compared to generalist models.
Fine-tuning can cause catastrophic forgetting of prior knowledge.
Abstract
Recent advances in microscopy have enabled the rapid generation of terabytes of image data in cell biology and biomedical research. Vision-language models (VLMs) offer a promising solution for large-scale biological image analysis, enhancing researchers' efficiency, identifying new image biomarkers, and accelerating hypothesis generation and scientific discovery. However, there is a lack of standardized, diverse, and large-scale vision-language benchmarks to evaluate VLMs' perception and cognition capabilities in biological image understanding. To address this gap, we introduce {\mu}-Bench, an expert-curated benchmark encompassing 22 biomedical tasks across various scientific disciplines (biology, pathology), microscopy modalities (electron, fluorescence, light), scales (subcellular, cellular, tissue), and organisms in both normal and abnormal states. We evaluate state-of-the-art…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCell Image Analysis Techniques · Machine Learning in Materials Science
MethodsBalanced Selection
