Sample-Optimal Density Estimation in Nearly-Linear Time
Jayadev Acharya, Ilias Diakonikolas, Jerry Li, Ludwig Schmidt

TL;DR
This paper introduces a nearly-linear time, sample-optimal algorithm for density estimation of univariate distributions approximated by piecewise polynomials, improving efficiency and accuracy over prior methods.
Contribution
The paper presents a unified, nearly-linear time algorithm for density estimation that is nearly optimal in sample complexity, applicable to various structured distribution families.
Findings
Achieves nearly optimal sample complexity for density estimation.
Runs in nearly-linear time, significantly faster than previous methods.
Performs well in practical experiments.
Abstract
We design a new, fast algorithm for agnostically learning univariate probability distributions whose densities are well approximated by piecewise polynomial functions. Let be the density function of an arbitrary univariate distribution, and suppose that is -close in -distance to an unknown piecewise polynomial function with interval pieces and degree . Our algorithm draws samples from , runs in time , and with probability at least outputs an -piecewise degree- hypothesis that is close to . Our general algorithm yields (nearly) sample-optimal and nearly-linear time estimators for a wide range of structured distribution families over both continuous and discrete domains in a unified way. For most of our applications, these are the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Machine Learning and Data Classification · Machine Learning in Healthcare
