Effective sample size: a measure of individual uncertainty in predictions
Doranne Thomassen, Saskia le Cessie, Hans van Houwelingen, Ewout, Steyerberg

TL;DR
This paper introduces the effective sample size as an intuitive, clinically interpretable measure of individual prediction uncertainty in clinical models, derived analytically for generalized linear models and illustrated with myocardial infarction data.
Contribution
It develops an analytical framework for calculating effective sample size to quantify individual prediction uncertainty, enhancing model development, validation, and clinical communication.
Findings
Effective sample size can quantify individual prediction uncertainty.
The measure helps identify patients with less reliable predictions.
Application to myocardial infarction data demonstrates practical utility.
Abstract
Clinical prediction models are estimated using a sample of limited size from the target population, leading to uncertainty in predictions, even when the model is correctly specified. Generally, not all patient profiles are observed uniformly in model development. As a result, sampling uncertainty varies between individual patients' predictions. We aimed to develop an intuitive measure of individual prediction uncertainty. The variance of a patient's prediction can be equated to the variance of the sample mean outcome in n* hypothetical patients with the same predictor values. This hypothetical sample size n* can be interpreted as the number of similar patients n_eff that the prediction is effectively based on, given that the model is correct. For generalised linear models, we derived analytical expressions for the effective sample size. In addition, we illustrated the concept in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare · Health Systems, Economic Evaluations, Quality of Life · Healthcare cost, quality, practices
