Deep ROC Analysis and AUC as Balanced Average Accuracy to Improve Model   Selection, Understanding and Interpretation

Andr\'e M. Carrington; Douglas G. Manuel; Paul W. Fieguth; Tim Ramsay,; Venet Osmani; Bernhard Wernly; Carol Bennett; Steven Hawken; Matthew McInnes,; Olivia Magwood; Yusuf Sheikh; Andreas Holzinger

arXiv:2103.11357·stat.ME·January 28, 2022·22 cites

Deep ROC Analysis and AUC as Balanced Average Accuracy to Improve Model Selection, Understanding and Interpretation

Andr\'e M. Carrington, Douglas G. Manuel, Paul W. Fieguth, Tim Ramsay,, Venet Osmani, Bernhard Wernly, Carol Bennett, Steven Hawken, Matthew McInnes,, Olivia Magwood, Yusuf Sheikh, Andreas Holzinger

PDF

Open Access

TL;DR

This paper introduces deep ROC analysis, a method that evaluates groups of predicted risks to improve model selection and interpretation, translating complex measures into familiar accuracy metrics.

Contribution

It proposes a novel deep ROC analysis approach that interprets AUC and partial AUC as balanced average accuracy, enhancing model evaluation and understanding.

Findings

01

Normalized partial AUC equals average sensitivity.

02

Normalized horizontal partial AUC equals average specificity.

03

Deep ROC analysis improves model selection in case studies.

Abstract

Optimal performance is critical for decision-making tasks from medicine to autonomous driving, however common performance measures may be too general or too specific. For binary classifiers, diagnostic tests or prognosis at a timepoint, measures such as the area under the receiver operating characteristic curve, or the area under the precision recall curve, are too general because they include unrealistic decision thresholds. On the other hand, measures such as accuracy, sensitivity or the F1 score are measures at a single threshold that reflect an individual single probability or predicted risk, rather than a range of individuals or risk. We propose a method in between, deep ROC analysis, that examines groups of probabilities or predicted risks for more insightful analysis. We translate esoteric measures into familiar terms: AUC and the normalized concordant partial AUC are balanced…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning in Healthcare · Artificial Intelligence in Healthcare · Explainable Artificial Intelligence (XAI)