Local Component Analysis
Nicolas Le Roux (INRIA Paris - Rocquencourt, LIENS), Francis Bach, (INRIA Paris - Rocquencourt, LIENS)

TL;DR
This paper introduces a method to learn full Euclidean metrics for kernel density estimation using EM, improving density estimation and unsupervised learning tasks like clustering and manifold learning.
Contribution
It proposes a novel EM-based approach to learn full metrics in kernel density estimation, including semi-parametric models, with closed-form updates and scalability enhancements.
Findings
Higher test-likelihoods than competing methods
Metrics improve spectral clustering and manifold learning
Effective in large-scale settings
Abstract
Kernel density estimation, a.k.a. Parzen windows, is a popular density estimation method, which can be used for outlier detection or clustering. With multivariate data, its performance is heavily reliant on the metric used within the kernel. Most earlier work has focused on learning only the bandwidth of the kernel (i.e., a scalar multiplicative factor). In this paper, we propose to learn a full Euclidean metric through an expectation-minimization (EM) procedure, which can be seen as an unsupervised counterpart to neighbourhood component analysis (NCA). In order to avoid overfitting with a fully nonparametric density estimator in high dimensions, we also consider a semi-parametric Gaussian-Parzen density model, where some of the variables are modelled through a jointly Gaussian density, while others are modelled through Parzen windows. For these two models, EM leads to simple…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Fault Detection and Control Systems · Statistical Methods and Inference
