TL;DR
This paper introduces the Kernel MRCD, a robust outlier detection method that works with non-elliptical data and high-dimensional datasets by leveraging kernel methods for improved flexibility and computational efficiency.
Contribution
The paper proposes the Kernel MRCD estimator, extending the MRCD to non-elliptical data and enhancing computational speed for high-dimensional datasets using kernel techniques.
Findings
Performs well in simulation studies
Effective in real-life data applications
Handles high-dimensional, non-elliptical data
Abstract
The minimum regularized covariance determinant method (MRCD) is a robust estimator for multivariate location and scatter, which detects outliers by fitting a robust covariance matrix to the data. Its regularization ensures that the covariance matrix is well-conditioned in any dimension. The MRCD assumes that the non-outlying observations are roughly elliptically distributed, but many datasets are not of that form. Moreover, the computation time of MRCD increases substantially when the number of variables goes up, and nowadays datasets with many variables are common. The proposed Kernel Minimum Regularized Covariance Determinant (KMRCD) estimator addresses both issues. It is not restricted to elliptical data because it implicitly computes the MRCD estimates in a kernel induced feature space. A fast algorithm is constructed that starts from kernel-based initial estimates and exploits the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
