Common Limitations of Image Processing Metrics: A Picture Story
Annika Reinke, Minu D. Tizabi, Carole H. Sudre, Matthias Eisenmann,, Tim R\"adsch, Michael Baumgartner, Laura Acion, Michela Antonelli, Tal Arbel,, Spyridon Bakas, Peter Bankhead, Arriel Benis, Matthew Blaschko, Florian, Buettner, M. Jorge Cardoso, Jianxu Chen

TL;DR
This paper critically examines common limitations of image processing metrics in biomedical image analysis, highlighting issues related to metric properties, dataset characteristics, and domain relevance, based on expert consensus.
Contribution
It provides a comprehensive analysis of practical pitfalls in using performance metrics for biomedical image analysis, emphasizing the need for careful metric selection and interpretation.
Findings
Metrics often ignore class imbalance effects
Test case non-independence impacts metric validity
Metrics may not reflect biomedical domain priorities
Abstract
While the importance of automatic image analysis is continuously increasing, recent meta-research revealed major flaws with respect to algorithm validation. Performance metrics are particularly key for meaningful, objective, and transparent performance assessment and validation of the used automatic algorithms, but relatively little attention has been given to the practical pitfalls when using specific metrics for a given image analysis task. These are typically related to (1) the disregard of inherent metric properties, such as the behaviour in the presence of class imbalance or small target structures, (2) the disregard of inherent data set properties, such as the non-independence of the test cases, and (3) the disregard of the actual biomedical domain interest that the metrics should reflect. This living dynamically document has the purpose to illustrate important limitations of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMedical Image Segmentation Techniques · Advanced Neural Network Applications · Image Retrieval and Classification Techniques
