Robust improper maximum likelihood: tuning, computation, and a comparison with other methods for robust Gaussian clustering
Pietro Coretto, Christian Hennig

TL;DR
This paper introduces the OTRIMLE, a robust Gaussian clustering method that uses an improper density for outliers, and compares its performance with other clustering techniques through extensive simulations and real data applications.
Contribution
The paper presents the OTRIMLE, a novel robust clustering estimator that optimally tunes the improper density for improved outlier handling and compares it comprehensively with existing methods.
Findings
OTRIMLE achieves the most satisfactory overall performance in simulations.
All methods perform best in some setups, but OTRIMLE is most consistent.
The approach is effective on real datasets with and without known clusters.
Abstract
The two main topics of this paper are the introduction of the "optimally tuned improper maximum likelihood estimator" (OTRIMLE) for robust clustering based on the multivariate Gaussian model for clusters, and a comprehensive simulation study comparing the OTRIMLE to Maximum Likelihood in Gaussian mixtures with and without noise component, mixtures of t-distributions, and the TCLUST approach for trimmed clustering. The OTRIMLE uses an improper constant density for modelling outliers and noise. This can be chosen optimally so that the non-noise part of the data looks as close to a Gaussian mixture as possible. Some deviation from Gaussianity can be traded in for lowering the estimated noise proportion. Covariance matrix constraints and computation of the OTRIMLE are also treated. In the simulation study, all methods are confronted with setups in which their model assumptions are not…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
