TL;DR
This paper investigates the theoretical and empirical aspects of optimizing Dice and Jaccard metrics directly in medical image segmentation, revealing that surrogate losses are equivalent and that direct optimization is preferable over cross-entropy.
Contribution
It provides a theoretical analysis showing the equivalence of different surrogates and empirically demonstrates that surrogate choice does not significantly affect segmentation performance.
Findings
Surrogates are equivalent up to a multiplicative factor.
No optimal weighted cross-entropy exists for Dice or Jaccard.
Choice of surrogate does not statistically impact performance.
Abstract
The Dice score and Jaccard index are commonly used metrics for the evaluation of segmentation tasks in medical imaging. Convolutional neural networks trained for image segmentation tasks are usually optimized for (weighted) cross-entropy. This introduces an adverse discrepancy between the learning optimization objective (the loss) and the end target metric. Recent works in computer vision have proposed soft surrogates to alleviate this discrepancy and directly optimize the desired metric, either through relaxations (soft-Dice, soft-Jaccard) or submodular optimization (Lov\'asz-softmax). The aim of this study is two-fold. First, we investigate the theoretical differences in a risk minimization framework and question the existence of a weighted cross-entropy loss with weights theoretically optimized to surrogate Dice or Jaccard. Second, we empirically investigate the behavior of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
