On the number of modes of Gaussian kernel density estimators
Borjan Geshkovski, Philippe Rigollet, Yihang Sun

TL;DR
This paper analyzes how the expected number of modes in Gaussian kernel density estimators scales with bandwidth and sample size, providing asymptotic results using advanced mathematical tools.
Contribution
It establishes the asymptotic scaling law for the expected number of modes of Gaussian kernel density estimators on the real line.
Findings
Expected number of modes scales as Θ(√(β log β))
Results hold for bandwidth and sample size in specified ranges
Uses Kac-Rice formula and Edgeworth expansion
Abstract
We consider the Gaussian kernel density estimator with bandwidth of iid Gaussian samples. Using the Kac-Rice formula and an Edgeworth expansion, we prove that the expected number of modes on the real line scales as as provided for some constant . An impetus behind this investigation is to determine the number of clusters to which Transformers are drawn in a metastable state.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic processes and statistical mechanics · Bayesian Methods and Mixture Models · Stochastic processes and financial applications
