# Minimax Estimation of the $L_1$ Distance

**Authors:** Jiantao Jiao, Yanjun Han, Tsachy Weissman

arXiv: 1705.00807 · 2018-06-26

## TL;DR

This paper develops minimax optimal estimators for the $L_1$ distance between two discrete probability measures, achieving near-optimal performance with fewer samples, and reveals the effective sample size enlargement phenomenon.

## Contribution

It introduces new techniques for constructing minimax rate-optimal estimators for $L_1$ distance, extending previous approximation-based methods and analyzing both known and unknown $Q$ scenarios.

## Key findings

- Minimax estimators achieve performance comparable to MLE with fewer samples.
- The uniform distribution case is the hardest for estimation.
- Effective sample size enlargement phenomenon is confirmed in both known and unknown $Q$ cases.

## Abstract

We consider the problem of estimating the $L_1$ distance between two discrete probability measures $P$ and $Q$ from empirical data in a nonasymptotic and large alphabet setting. When $Q$ is known and one obtains $n$ samples from $P$, we show that for every $Q$, the minimax rate-optimal estimator with $n$ samples achieves performance comparable to that of the maximum likelihood estimator (MLE) with $n\ln n$ samples. When both $P$ and $Q$ are unknown, we construct minimax rate-optimal estimators whose worst case performance is essentially that of the known $Q$ case with $Q$ being uniform, implying that $Q$ being uniform is essentially the most difficult case. The \emph{effective sample size enlargement} phenomenon, identified in Jiao \emph{et al.} (2015), holds both in the known $Q$ case for every $Q$ and the $Q$ unknown case. However, the construction of optimal estimators for $\|P-Q\|_1$ requires new techniques and insights beyond the approximation-based method of functional estimation in Jiao \emph{et al.} (2015).

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1705.00807/full.md

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/1705.00807/full.md

## References

36 references — full list in the complete paper: https://tomesphere.com/paper/1705.00807/full.md

---
Source: https://tomesphere.com/paper/1705.00807