TL;DR
This paper characterizes the guesswork of memoryless string-sources using tilt operations, establishing a connection with exponential families, and extends typical set concepts to analyze large deviations and approximate guesswork distributions.
Contribution
It introduces the tilted family framework for memoryless sources, linking guesswork to exponential families and generalizing typical set concepts for large deviation analysis.
Findings
Tilt operation parametrizes exponential families of string-sources.
Same guesswork for all string lengths implies sources are in the same tilted family.
Provides large deviation bounds and accurate PMF approximations for guesswork.
Abstract
Given a collection of strings, each with an associated probability of occurrence, the guesswork of each of them is their position in a list ordered from most likely to least likely, breaking ties arbitrarily. Guesswork is central to several applications in information theory: Average guesswork provides a lower bound on the expected computational cost of a sequential decoder to decode successfully the transmitted message; the complementary cumulative distribution function of guesswork gives the error probability in list decoding; the logarithm of guesswork is the number of bits needed in optimal lossless one-to-one source coding; and guesswork is the number of trials required of an adversary to breach a password protected system in a brute-force attack. In this paper, we consider memoryless string-sources that generate strings consisting of i.i.d. characters drawn from a finite alphabet,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
