How to Shrink Confidence Sets for Many Equivalent Discrete   Distributions?

Odalric-Ambrym Maillard; Mohammad Sadegh Talebi

arXiv:2407.15662·stat.ML·July 23, 2024

How to Shrink Confidence Sets for Many Equivalent Discrete Distributions?

Odalric-Ambrym Maillard, Mohammad Sadegh Talebi

PDF

Open Access

TL;DR

This paper develops a method to refine confidence sets for multiple unknown discrete distributions that are permutation-equivalent, leveraging their structural relationship to improve confidence bounds with finite-sample guarantees.

Contribution

It introduces a strategy to exploit permutation-equivalence among distributions, providing finite-time bounds and demonstrating asymptotic shrinkage of confidence sets.

Findings

01

Refined confidence sets improve with enough observations.

02

Confidence set sizes shrink at rates of O(1/√sum n_k) and O(1/max n_k).

03

Method benefits reinforcement learning tasks.

Abstract

We consider the situation when a learner faces a set of unknown discrete distributions $(p_{k})_{k \in K}$ defined over a common alphabet $X$ , and can build for each distribution $p_{k}$ an individual high-probability confidence set thanks to $n_{k}$ observations sampled from $p_{k}$ . The set $(p_{k})_{k \in K}$ is structured: each distribution $p_{k}$ is obtained from the same common, but unknown, distribution q via applying an unknown permutation to $X$ . We call this \emph{permutation-equivalence}. The goal is to build refined confidence sets \emph{exploiting} this structural property. Like other popular notions of structure (Lipschitz smoothness, Linearity, etc.) permutation-equivalence naturally appears in machine learning problems, and to benefit from its potential gain calls for a specific approach. We present a strategy to effectively exploit…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical Methods and Inference · Bayesian Methods and Mixture Models · Advanced Statistical Process Monitoring

MethodsSparse Evolutionary Training