Fragility-aware Classification for Understanding Risk and Improving Generalization
Chen Yang, Zheng Cui, Daniel Zhuoyu Long, Jin Qi, Ruohan Zhan

TL;DR
This paper introduces the Fragility Index (FI), a new metric for evaluating classifiers' risk of confident misjudgments, and develops a training framework to minimize FI, enhancing robustness in safety-critical applications.
Contribution
The paper proposes the FI metric and a surrogate loss-based training method to directly optimize it, with theoretical bounds and extensions to deep neural networks.
Findings
FI reveals error tail risk not captured by accuracy or AUC.
Models trained to minimize FI reduce confident misjudgments in medical diagnosis tasks.
FI-based training maintains competitive accuracy while improving robustness.
Abstract
Classification models play a central role in data-driven decision-making applications such as medical diagnosis, recommendation systems, and risk assessment. Traditional performance metrics, such as accuracy and AUC, focus on overall error rates but fail to account for the confidence of incorrect predictions, i.e., the risk of confident misjudgments. This limitation is particularly consequential in safety-critical and cost-sensitive settings, where overconfident errors can lead to severe outcomes. To address this issue, we propose the Fragility Index (FI), a novel performance metric that evaluates classifiers from a risk-averse perspective by capturing the tail risk of confident misjudgments. We formulate FI within a robust satisficing (RS) framework to ensure robustness under distributional uncertainty. Building on this, we develop a tractable training framework that directly targets…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
