Addressing patient heterogeneity in disease predictive model development
Xu Gao, Weining Shen, Jing Ning, Ziding Feng, Jianhua Hu

TL;DR
This paper introduces a hypothesis testing framework using mixture models and EM algorithms to detect and determine patient subgroups in biomedical prediction models, enhancing personalized medicine.
Contribution
It proposes a novel statistical test for identifying patient heterogeneity and subgroup structure, with computational efficiency and practical validation in cancer data.
Findings
Test maintains appropriate type-I error rates.
Test demonstrates high power in simulations.
Method successfully applied to prostate cancer data.
Abstract
This paper addresses patient heterogeneity associated with prediction problems in biomedical applications. We propose a systematic hypothesis testing approach to determine the existence of patient subgroup structure and the number of subgroups in patient population if subgroups exist. A mixture of generalized linear models is considered to model the relationship between the disease outcome and patient characteristics and clinical factors, including targeted biomarker profiles. We construct a test statistic based on expectation maximization (EM) algorithm and derive its asymptotic distribution under the null hypothesis. An important computational advantage of the test is that the involved parameter estimates under the complex alternative hypothesis can be obtained through a small number of EM iterations, rather than optimizing the objective function. We demonstrate the finite sample…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Statistical Methods in Clinical Trials · Bayesian Methods and Mixture Models
