How to Shrink Confidence Sets for Many Equivalent Discrete Distributions?
Odalric-Ambrym Maillard, Mohammad Sadegh Talebi

TL;DR
This paper develops a method to refine confidence sets for multiple unknown discrete distributions that are permutation-equivalent, leveraging their structural relationship to improve confidence bounds with finite-sample guarantees.
Contribution
It introduces a strategy to exploit permutation-equivalence among distributions, providing finite-time bounds and demonstrating asymptotic shrinkage of confidence sets.
Findings
Refined confidence sets improve with enough observations.
Confidence set sizes shrink at rates of O(1/√sum n_k) and O(1/max n_k).
Method benefits reinforcement learning tasks.
Abstract
We consider the situation when a learner faces a set of unknown discrete distributions defined over a common alphabet , and can build for each distribution an individual high-probability confidence set thanks to observations sampled from . The set is structured: each distribution is obtained from the same common, but unknown, distribution q via applying an unknown permutation to . We call this \emph{permutation-equivalence}. The goal is to build refined confidence sets \emph{exploiting} this structural property. Like other popular notions of structure (Lipschitz smoothness, Linearity, etc.) permutation-equivalence naturally appears in machine learning problems, and to benefit from its potential gain calls for a specific approach. We present a strategy to effectively exploit…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Bayesian Methods and Mixture Models · Advanced Statistical Process Monitoring
MethodsSparse Evolutionary Training
