A decomposition of Fisher's information to inform sample size for developing or updating fair and precise clinical prediction models -- Part 3: continuous outcomes
Rebecca Whittle, Richard D Riley, Lucinda Archer, Gary S Collins, Amardeep Legha, Kym IE Snell, Joie Ensor

TL;DR
This paper introduces a methodology based on Fisher's information to determine the sample size needed for developing or updating clinical prediction models with continuous outcomes, emphasizing prediction precision and fairness across subgroups.
Contribution
It presents a novel approach that links sample size to prediction uncertainty and fairness, extending existing guidelines to explicitly address prediction precision and subgroup equity.
Findings
Method calculates sample size for desired prediction confidence intervals.
Approach evaluates existing dataset adequacy for model development.
Extensions include addressing aleatoric uncertainty in predictions.
Abstract
Clinical prediction models enable healthcare professionals to estimate individual outcomes using patient characteristics. Current sample size guidelines for developing or updating models with continuous outcomes aim to minimise overfitting and ensure accurate estimation of population-level parameters, but do not explicitly address the precision of predictions. This is a critical limitation, as wide confidence intervals around predictions can undermine clinical utility and fairness, particularly if precision varies across subgroups. We propose methodology for calculating the sample size required to ensure precise and fair predictions in models with continuous outcomes. Building on linear regression theory and the Fisher's unit information matrix, our approach calculates how sample size impacts the epistemic (model-based) uncertainty of predictions and allows researchers to either (i)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods in Clinical Trials · Statistical Methods and Inference · Health Systems, Economic Evaluations, Quality of Life
