Estimation of clusterwise linear regression models with a shrinkage-like approach
Roberto Di Mari, Roberto Rocci, Stefano Antonio Gattone

TL;DR
This paper introduces a data-driven constrained maximum likelihood estimation method for clusterwise linear regression that balances heteroscedastic and homoscedastic models, improving parameter estimation and classification accuracy.
Contribution
It extends equivariant data-driven estimation techniques to clusterwise linear regression, incorporating bounds on variances based on a data-derived target, with a shrinkage interpretation.
Findings
Method outperforms traditional approaches in simulations.
Provides better parameter estimates balancing heteroscedasticity and homoscedasticity.
Enhances classification accuracy in real-data applications.
Abstract
Constrained approaches to maximum likelihood estimation in the context of finite mixtures of normals have been presented in the literature. A fully data-dependent constrained method for maximum likelihood estimation of clusterwise linear regression is proposed, which extends previous work in equivariant data-driven estimation of finite mixtures of Gaussians for classification. The method imposes plausible bounds on the component variances, based on a target value estimated from the data, which we take to be the homoscedastic variance. Nevertheless, the present work does not only focus on classification recovery, but also on how well model parameters are estimated. In particular, the paper sheds light on the shrinkage-like interpretation of the procedure, where the target is the homoscedastic model: this is not only related to how close to the target the estimated scales are, but extends…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Methods and Mixture Models · Advanced Clustering Algorithms Research · Statistical Methods and Bayesian Inference
