The Construction of Near-optimal Universal Coding of Integers
Wei Yan, Yunghsiang S. Han

TL;DR
This paper introduces a near-optimal universal coding scheme for integers, called the ta code, which achieves a minimal expansion factor very close to the theoretical lower bound, improving integer compression efficiency.
Contribution
The paper develops a new probability inequality for decreasing distributions and constructs the ta code, achieving a near-optimal expansion factor of 2.0386, narrowing the bounds for optimal integer coding.
Findings
The ta code achieves an expansion factor of 2.0386.
The minimum expansion factor for optimal UCI is bounded between 2 and 2.0386.
A new probability inequality for decreasing distributions is established.
Abstract
The Universal Coding of Integers~(UCI) is suitable for discrete memoryless sources with unknown probability distributions and infinitely countable alphabet sizes. A UCI is a class of prefix codes for which the ratio of the average codeword length to is within a constant expansion factor \textcolor{red}{} for any decreasing probability distribution , where is the entropy of . For any UCI code , \emph{the minimum expansion factor} \textcolor{red}{} is defined to represent the infimum of the set of extension factors of . Each has a unique corresponding \textcolor{red}{}, and the smaller \textcolor{red}{} is, the better the compression performance of is. The class of UCIs (or a family…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
