Fast rates for noisy clustering
S\'ebastien Loustau (LAREMA)

TL;DR
This paper establishes fast convergence rates for noisy clustering by analyzing the impact of measurement errors and deriving bounds that depend on the regularity of the data density and the noise level.
Contribution
It introduces a general upper bound for empirical minimization with noisy data using deconvolution kernels and applies it to derive convergence rates in noisy clustering.
Findings
Achieves a convergence rate of (1/n^{rac{\u03b3}{ 7+ 7})}) depending on data regularity and noise.
Extends existing bounds to the noisy setting, showing how noise affects convergence rates.
Provides theoretical insights into the impact of measurement errors on clustering performance.
Abstract
The effect of errors in variables in empirical minimization is investigated. Given a loss and a set of decision rules , we prove a general upper bound for an empirical minimization based on a deconvolution kernel and a noisy sample . We apply this general upper bound to give the rate of convergence for the expected excess risk in noisy clustering. A recent bound from \citet{levrard} proves that this rate is in the direct case, under Pollard's regularity assumptions. Here the effect of noisy measurements gives a rate of the form , where is the H\"older regularity of the density of whereas is the degree of illposedness.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Machine Learning and Algorithms · Bayesian Methods and Mixture Models
