TL;DR
MorphoHELM is a comprehensive benchmark for evaluating microscopy image representations, assessing robustness to noise and comparing various models, including deep learning and classic methods.
Contribution
It consolidates evaluation standards for Cell Painting assays, extends them for robustness, and provides a broad comparison of methods, revealing trade-offs and strengths.
Findings
No model outperforms classic methods across all settings.
Models excel at certain biological signals but are weaker at others.
Benchmark evaluates robustness to increasing technical noise.
Abstract
Microscopy images contain rich information about how cells respond to perturbations, making them essential to applications like drug screening. To quantify images, researchers often use representation extraction methods, and recent years have seen a proliferation of deep learning methods. While measuring the quality of these representations is essential, evaluation remains fragmented, with each proposed model evaluated on different tasks and datasets, using custom pipelines and metrics, making it difficult to fairly compare models. Here, we introduce MorphoHELM, a comprehensive open benchmark for evaluating feature extraction methods for Cell Painting, the most widely-used morphological profiling assay. MorphoHELM consolidates evaluation standards in the field, extends and corrects them to be more robust, and evaluates on the widest range of methods to date. A defining feature of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
