Analyzing covariate clustering effects in healthcare cost subgroups: insights and applications for prediction
Zhengxiao Li, Yifan Huang, Yang Cao

TL;DR
This paper introduces a novel covariate clustering framework within finite mixture regression models to improve healthcare cost prediction, effectively handling high-dimensional, correlated, and complex cost data.
Contribution
It presents a new convex optimization approach with a specialized EM-ADMM algorithm for covariate clustering, enhancing prediction accuracy over traditional methods.
Findings
The framework accurately captures complex covariate relationships.
Simulation studies confirm convergence and efficiency.
Real data applications demonstrate improved prediction performance.
Abstract
Healthcare cost prediction is a challenging task due to the high-dimensionality and high correlation among covariates. Additionally, the skewed, heavy-tailed, and often multi-modal nature of cost data can complicate matters further due to unobserved heterogeneity. In this study, we propose a novel framework for finite mixture regression models that incorporates covariate clustering methods to better account for the effects of clustered covariates on subgroups of the outcome, which enables a more accurate characterization of the complex distribution of the data. The proposed framework can be formulated as a convex optimization problem with an additional penalty term based on the prior similarity of the covariates. To efficiently solve this optimization problem, a specialized EM-ADMM algorithm is proposed that integrates the alternating direction multiplicative method (ADMM) into the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Bayesian Methods and Mixture Models · Statistical Methods and Bayesian Inference
