Robust Clustering with Normal Mixture Models: A Pseudo $\beta$-Likelihood Approach
Soumya Chakraborty, Ayanendranath Basu, Abhik Ghosh

TL;DR
This paper introduces a robust clustering method for normal mixture models using a pseudo β-likelihood approach, improving outlier detection and cluster accuracy in complex, overlapping data scenarios.
Contribution
It proposes a novel robust estimation technique based on density power divergence, with an iterative reweighted least squares algorithm for simultaneous clustering and anomaly detection.
Findings
Performs better than existing methods in overlapping clusters
Effective in detecting outliers and anomalies
Maintains low misclassification rates in real datasets
Abstract
As in other estimation scenarios, likelihood based estimation in the normal mixture set-up is highly non-robust against model misspecification and presence of outliers (apart from being an ill-posed optimization problem). A robust alternative to the ordinary likelihood approach for this estimation problem is proposed which performs simultaneous estimation and data clustering and leads to subsequent anomaly detection. To invoke robustness, the methodology based on the minimization of the density power divergence (or alternatively, the maximization of the -likelihood) is utilized under suitable constraints. An iteratively reweighted least squares approach has been followed in order to compute the proposed estimators for the component means (or equivalently cluster centers) and component dispersion matrices which leads to simultaneous data clustering. Some exploratory techniques are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Statistical Methods and Models · Bayesian Methods and Mixture Models · Statistical Distribution Estimation and Applications
