Estimating location parameters in entangled single-sample distributions
Ankit Pensia, Varun Jog, Po-Ling Loh

TL;DR
This paper introduces a novel adaptive estimator for the common mean of heterogeneous, symmetric distributions, achieving near-optimal accuracy in univariate and multivariate settings, with extensions to linear regression.
Contribution
It develops a new estimator combining classical methods and novel empirical process theory, applicable to non-i.i.d. data, with polynomial-time algorithms for multivariate and regression cases.
Findings
Achieves near-optimal estimation in heterogeneous data scenarios.
Provides minimax lower bounds for mean estimation in mixture models.
Extends methodology to multivariate and linear regression settings.
Abstract
We consider the problem of estimating the common mean of independently sampled data, where samples are drawn in a possibly non-identical manner from symmetric, unimodal distributions with a common mean. This generalizes the setting of Gaussian mixture modeling, since the number of distinct mixture components may diverge with the number of observations. We propose an estimator that adapts to the level of heterogeneity in the data, achieving near-optimality in both the i.i.d. setting and some heterogeneous settings, where the fraction of ``low-noise'' points is as small as . Our estimator is a hybrid of the modal interval, shorth, and median estimators from classical statistics; however, the key technical contributions rely on novel empirical process theory results that we derive for independent but non-i.i.d. data. In the multivariate setting, we generalize our theory…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Methods and Mixture Models · Statistical Methods and Inference · Gaussian Processes and Bayesian Inference
