Weighted Brier Score -- an Overall Summary Measure for Risk Prediction Models with Clinical Utility Consideration
Kehao Zhu, Yingye Zheng, Kwun Chuen Gary Chan

TL;DR
This paper introduces a weighted Brier score tailored to evaluate the clinical utility of risk prediction models, decomposing it into discrimination and calibration, and demonstrating its application with prostate cancer data.
Contribution
It proposes a novel weighted Brier score aligned with clinical utility, linking it to the $H$ measure and providing a comprehensive evaluation framework for risk models.
Findings
Weighted Brier score effectively assesses clinical utility.
Decomposition reveals contributions of discrimination and calibration.
Application to prostate cancer data demonstrates practical utility.
Abstract
As advancements in novel biomarker-based algorithms and models accelerate disease risk prediction and stratification in medicine, it is crucial to evaluate these models within the context of their intended clinical application. Prediction models output the absolute risk of disease; subsequently, patient counseling and shared decision-making are based on the estimated individual risk and cost-benefit assessment. The overall impact of the application is often referred to as clinical utility, which received significant attention in terms of model assessment lately. The classic Brier score is a popular measure of prediction accuracy; however, it is insufficient for effectively assessing clinical utility. To address this limitation, we propose a class of weighted Brier scores that aligns with the decision-theoretic framework of clinical utility. Additionally, we decompose the weighted Brier…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMeta-analysis and systematic reviews
