On Precision - Redundancy Relation in the Design of Source Coding Algorithms
Yuriy Reznik

TL;DR
This paper explores how finite-precision probability representations affect source coding efficiency, establishing a quantitative relation between redundancy and memory bits needed, with implications for practical coding algorithm design.
Contribution
It introduces a simple relation between redundancy and probability representation bits, revealing bounds on efficiency for various coding schemes and alphabet sizes.
Findings
Redundancy W is related to bits W by W ≈ η log2(m/R).
Existence of codes with η = 1/2 for binary alphabets.
Existence of codes with η = m/(m+1) for m-ary alphabets.
Abstract
We study the effects of finite-precision representation of source's probabilities on the efficiency of classic source coding algorithms, such as Shannon, Gilbert-Moore, or arithmetic codes. In particular, we establish the following simple connection between the redundancy and the number of bits necessary for representation of source's probabilities in computer's memory ( is assumed to be small): \begin{equation*} W \lesssim \eta \log_2 \frac{m}{R}, \end{equation*} where is the cardinality of the source's alphabet, and is an implementation-specific constant. In case of binary alphabets () we show that there exist codes for which , and in -ary case () we show that there exist codes for which . In general case, however (which includes designs relying on progressive updates of frequency counters), we show that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgorithms and Data Compression · Computability, Logic, AI Algorithms · Machine Learning and Algorithms
