On Semi-supervised Estimation of Discrete Distributions under f-divergences
Hasan Sabri Melihcan Erol, Lizhong Zheng

TL;DR
This paper investigates the minimax estimation of joint discrete distributions under various f-divergences and loss functions, confirming the optimality of certain estimators for a broad class of divergences and loss metrics.
Contribution
It proves the minimax optimality of univariate estimator combinations for a range of p-values and f-divergences, extending previous results to include p ≤ 2 and multiple divergence measures.
Findings
Establishes minimax optimality for p in [1,2], including l1 loss.
Confirms optimality for divergences like KL, chi-squared, Hellinger, and Le Cam.
Extends prior work to broader divergence and loss function ranges.
Abstract
We study the problem of estimating the joint probability mass function (pmf) over two random variables. In particular, the estimation is based on the observation of samples containing both variables and samples missing one fixed variable. We adopt the minimax framework with loss functions. Recent work established that univariate minimax estimator combinations achieve minimax risk with the optimal first-order constant for in the regime , questions remained for and various -divergences. In our study, we affirm that these composite estimators are indeed minimax optimal for loss functions, specifically for the range , including the critical loss. Additionally, we ascertain their optimality for a suite of -divergences, such as KL, , Squared Hellinger, and Le Cam divergences.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Statistical Methods and Models · Advanced Statistical Process Monitoring · Forecasting Techniques and Applications
