Optimal Confidence Regions for the Multinomial Parameter

Matthew L. Malloy; Ardhendu Tripathy; Robert D. Nowak

arXiv:2002.01044·stat.ML·February 1, 2021·6 cites

Optimal Confidence Regions for the Multinomial Parameter

Matthew L. Malloy, Ardhendu Tripathy, Robert D. Nowak

PDF

Open Access

TL;DR

This paper introduces a new theoretical framework for constructing minimum average volume confidence regions for multinomial parameters, leading to optimal confidence intervals and improved sample efficiency in machine learning.

Contribution

It develops the first theory for minimum average volume confidence regions for categorical data, answering a longstanding open problem.

Findings

01

Constructed confidence regions with minimal average volume.

02

Proved the optimality of these regions for linear functionals.

03

Demonstrated improvements in sample complexity and regret in machine learning.

Abstract

Construction of tight confidence regions and intervals is central to statistical inference and decision making. This paper develops new theory showing minimum average volume confidence regions for categorical data. More precisely, consider an empirical distribution $p$ generated from $n$ iid realizations of a random variable that takes one of $k$ possible values according to an unknown distribution $p$ . This is analogous to a single draw from a multinomial distribution. A confidence region is a subset of the probability simplex that depends on $p$ and contains the unknown $p$ with a specified confidence. This paper shows how one can construct minimum average volume confidence regions, answering a long standing question. We also show the optimality of the regions directly translates to optimal confidence intervals of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Reinforcement Learning in Robotics · Advanced Multi-Objective Optimization Algorithms