Methods and open-source toolkit for analyzing and visualizing challenge results
Manuel Wiesenfarth, Annika Reinke, Bennett A. Landman, Manuel Jorge, Cardoso, Lena Maier-Hein, Annette Kopp-Schneider

TL;DR
This paper introduces a set of methods and an open-source toolkit for analyzing and visualizing challenge results in biomedical image analysis, addressing gaps in uncertainty visualization and comprehensive performance assessment.
Contribution
The paper presents novel methods and releases challengeR, an open-source framework for improved analysis and visualization of challenge outcomes in biomedical imaging.
Findings
Enhanced visualization of challenge results with uncertainty representation
Application to real and simulated challenges demonstrates method effectiveness
Open-source toolkit facilitates widespread adoption and improved analysis
Abstract
Biomedical challenges have become the de facto standard for benchmarking biomedical image analysis algorithms. While the number of challenges is steadily increasing, surprisingly little effort has been invested in ensuring high quality design, execution and reporting for these international competitions. Specifically, results analysis and visualization in the event of uncertainties have been given almost no attention in the literature. Given these shortcomings, the contribution of this paper is two-fold: (1) We present a set of methods to comprehensively analyze and visualize the results of single-task and multi-task challenges and apply them to a number of simulated and real-life challenges to demonstrate their specific strengths and weaknesses; (2) We release the open-source framework challengeR as part of this work to enable fast and wide adoption of the methodology proposed in this…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
