Optimal Confidence Regions for the Multinomial Parameter
Matthew L. Malloy, Ardhendu Tripathy, Robert D. Nowak

TL;DR
This paper introduces a new theoretical framework for constructing minimum average volume confidence regions for multinomial parameters, leading to optimal confidence intervals and improved sample efficiency in machine learning.
Contribution
It develops the first theory for minimum average volume confidence regions for categorical data, answering a longstanding open problem.
Findings
Constructed confidence regions with minimal average volume.
Proved the optimality of these regions for linear functionals.
Demonstrated improvements in sample complexity and regret in machine learning.
Abstract
Construction of tight confidence regions and intervals is central to statistical inference and decision making. This paper develops new theory showing minimum average volume confidence regions for categorical data. More precisely, consider an empirical distribution generated from iid realizations of a random variable that takes one of possible values according to an unknown distribution . This is analogous to a single draw from a multinomial distribution. A confidence region is a subset of the probability simplex that depends on and contains the unknown with a specified confidence. This paper shows how one can construct minimum average volume confidence regions, answering a long standing question. We also show the optimality of the regions directly translates to optimal confidence intervals of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Reinforcement Learning in Robotics · Advanced Multi-Objective Optimization Algorithms
