PAC Mode Estimation using PPR Martingale Confidence Sequences
Shubham Anand Jain, Rohan Shah, Sanit Gupta, Denil Mehta, Inderjeet, Jayakumar Nair, Jian Vora, Sushil Khyalia, Sourav Das, Vinay J. Ribeiro,, Shivaram Kalyanakrishnan

TL;DR
This paper introduces a new, efficient method for identifying the most common value in a discrete distribution using confidence sequences, improving sample efficiency in practical sampling tasks like election forecasting.
Contribution
It generalizes PPR martingale confidence sequences from binary to multi-class mode estimation, providing an asymptotically optimal, parameter-free, and computationally efficient stopping rule.
Findings
PPR-1v1 outperforms competitors in sample efficiency.
The method is asymptotically optimal as error probability approaches zero.
Effective in practical applications like election forecasting and blockchain verification.
Abstract
We consider the problem of correctly identifying the \textit{mode} of a discrete distribution with sufficiently high probability by observing a sequence of i.i.d. samples drawn from . This problem reduces to the estimation of a single parameter when has a support set of size . After noting that this special case is tackled very well by prior-posterior-ratio (PPR) martingale confidence sequences \citep{waudby-ramdas-ppr}, we propose a generalisation to mode estimation, in which may take values. To begin, we show that the "one-versus-one" principle to generalise from to classes is more efficient than the "one-versus-rest" alternative. We then prove that our resulting stopping rule, denoted PPR-1v1, is asymptotically optimal (as the mistake probability is taken to ). PPR-1v1 is parameter-free and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAuction Theory and Applications · Probability and Risk Models · Consumer Market Behavior and Pricing
