Overcoming computational inability to predict clinical outcome from high-dimensional patient data using Bayesian methods
A Shalabi, A C C Coolen, and E de Rinaldis

TL;DR
This paper introduces a Bayesian prediction method for high-dimensional clinical data that overcomes computational challenges and reduces overfitting, enabling accurate outcome prediction with arbitrary data dimensions.
Contribution
The authors develop an exact analytical Bayesian approach that simplifies high-dimensional integrals, making outcome prediction feasible and efficient for large d, unlike previous methods.
Findings
Performs as well or better than mclustDA in low dimensions
Remains computationally feasible for very high dimensions
Reduces integral dimensions from 2d to 4, then 3 for large d
Abstract
Clinical outcome prediction from high-dimensional data is problematic in the common setting where there is only a relatively small number of samples. The imbalance causes data overfitting, and outcome prediction becomes computationally expensive or even impossible. We propose a Bayesian outcome prediction method that can be applied to data of arbitrary dimension d, from 2 outcome classes, and reduces overfitting without any approximations at parameter level. This is achieved by avoiding numerical integration or approximation, and solving the Bayesian integrals analytically. We thereby reduce the dimension of numerical integrals from 2d dimensions to 4, for any d. For large d, this is reduced further to 3, and we obtain a simple outcome prediction formula without integrals in leading order for very large d. We compare our method to the mclustDA method (Fraley and Raftery 2002), using…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Methods and Mixture Models · Statistical Methods and Inference · Gaussian Processes and Bayesian Inference
