Metrics reloaded: Recommendations for image analysis validation
Lena Maier-Hein, Annika Reinke, Patrick Godau, Minu D. Tizabi, Florian, Buettner, Evangelia Christodoulou, Ben Glocker, Fabian Isensee, Jens, Kleesiek, Michal Kozubek, Mauricio Reyes, Michael A. Riegler, Manuel, Wiesenfarth, A. Emre Kavur, Carole H. Sudre, Michael Baumgartner

TL;DR
Metrics Reloaded is a comprehensive framework and online tool designed to improve validation metric selection in biomedical image analysis, addressing current flaws and promoting domain-aware, problem-specific evaluation practices.
Contribution
It introduces a structured problem fingerprint concept and a multi-stage Delphi process to guide the selection of appropriate validation metrics in image analysis.
Findings
Framework applicable across various biomedical image analysis tasks
Online tool facilitates user-friendly metric selection
Demonstrated effectiveness in multiple biological and medical use cases
Abstract
Increasing evidence shows that flaws in machine learning (ML) algorithm validation are an underestimated global problem. Particularly in automatic biomedical image analysis, chosen performance metrics often do not reflect the domain interest, thus failing to adequately measure scientific progress and hindering translation of ML techniques into practice. To overcome this, our large international expert consortium created Metrics Reloaded, a comprehensive framework guiding researchers in the problem-aware selection of metrics. Following the convergence of ML methodology across application domains, Metrics Reloaded fosters the convergence of validation methodology. The framework was developed in a multi-stage Delphi process and is based on the novel concept of a problem fingerprint - a structured representation of the given problem that captures all aspects that are relevant for metric…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Healthcare and Education · Explainable Artificial Intelligence (XAI) · Machine Learning in Materials Science
MethodsAttentive Walk-Aggregating Graph Neural Network
