Performance of the empirical median for location estimation in heteroscedastic settings
Sirine Louati

TL;DR
This paper analyzes the empirical median's effectiveness for estimating a common location parameter in heteroscedastic data, providing bounds that clarify its robustness and limitations in practical, variable-quality data scenarios.
Contribution
It offers the first non-asymptotic bounds on the empirical median's error in heteroscedastic settings, enhancing understanding of its performance and limitations.
Findings
Derived matching upper and lower bounds on estimation error
Characterized robustness and limitations of the empirical median
Provided insights into median's performance with variable data quality
Abstract
We investigate the performance of the empirical median for location estimation in heteroscedastic settings. Specifically, we consider independent symmetric real-valued random variables that share a common but unknown location parameter while having different and unknown scale parameters. Estimation under heteroscedasticity arises naturally in many practical situations and has recently attracted considerable attention. In this work, we analyze the empirical median as an estimator of the common location parameter and derive matching non-asymptotic upper and lower bounds on its estimation error. These results fully characterize the behavior of the empirical median in heteroscedastic settings, clarifying both its robustness and its intrinsic limitations and offering a precise understanding of its performance in modern settings where data quality may vary across sources.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsProbability and Risk Models · Analysis of environmental and stochastic processes · Statistical Methods and Inference
