Estimating location parameters in entangled single-sample distributions

Ankit Pensia; Varun Jog; Po-Ling Loh

arXiv:1907.03087·math.ST·July 9, 2019·1 cites

Estimating location parameters in entangled single-sample distributions

Ankit Pensia, Varun Jog, Po-Ling Loh

PDF

Open Access

TL;DR

This paper introduces a novel adaptive estimator for the common mean of heterogeneous, symmetric distributions, achieving near-optimal accuracy in univariate and multivariate settings, with extensions to linear regression.

Contribution

It develops a new estimator combining classical methods and novel empirical process theory, applicable to non-i.i.d. data, with polynomial-time algorithms for multivariate and regression cases.

Findings

01

Achieves near-optimal estimation in heterogeneous data scenarios.

02

Provides minimax lower bounds for mean estimation in mixture models.

03

Extends methodology to multivariate and linear regression settings.

Abstract

We consider the problem of estimating the common mean of independently sampled data, where samples are drawn in a possibly non-identical manner from symmetric, unimodal distributions with a common mean. This generalizes the setting of Gaussian mixture modeling, since the number of distinct mixture components may diverge with the number of observations. We propose an estimator that adapts to the level of heterogeneity in the data, achieving near-optimality in both the i.i.d. setting and some heterogeneous settings, where the fraction of ``low-noise'' points is as small as $\frac{l o g n}{n}$ . Our estimator is a hybrid of the modal interval, shorth, and median estimators from classical statistics; however, the key technical contributions rely on novel empirical process theory results that we derive for independent but non-i.i.d. data. In the multivariate setting, we generalize our theory…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBayesian Methods and Mixture Models · Statistical Methods and Inference · Gaussian Processes and Bayesian Inference