Is the new model better? One metric says yes, but the other says no. Which metric do I use?
Qian M. Zhou, Zhe Lu, Russell J. Brooke, Melissa M Hudson, Yan Yuan

TL;DR
This paper compares two incremental value metrics, IncV-AUC and IncV-AP, analyzing their differences, relationships, and implications for model evaluation, especially when metrics conflict in medical risk prediction.
Contribution
It provides an analytical comparison of IncV-AUC and IncV-AP, revealing their weighting schemes and relationships with proper scoring rules, aiding better metric selection.
Findings
IncV-AP emphasizes high-risk group changes, IncV-AUC weights all risk groups equally.
IncV-AP and IncV-sBrS are highly consistent, IncV-AUC shows negative correlation with them.
Differences between metrics increase as the event rate decreases.
Abstract
Incremental value (IncV) evaluates the performance change from an existing risk model to a new model. It is one of the key considerations in deciding whether a new risk model performs better than the existing one. Problems arise when different IncV metrics contradict each other. For example, compared with a prescribed-dose model, an ovarian-dose model for predicting acute ovarian failure has a slightly lower area under the receiver operating characteristic curve (AUC) but increases the area under the precision-recall curve (AP) by 48%. This phenomenon of conflicting conclusions is not uncommon, and it creates a dilemma in medical decision making. In this article, we examine the analytical connections and differences between two IncV metrics: IncV in AUC (IncV-AUC) and IncV in AP (IncV-AP). Additionally, since they are both semi-proper scoring rules, we compare them with a strictly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDiet and metabolism studies · Mitochondrial Function and Pathology · Liver Disease Diagnosis and Treatment
