Analysis and Practice of Uniquely Decodable One-to-One Code
Chin-Fu Liu, Hsiao-feng Lu, Po-ning Chen

TL;DR
This paper studies uniquely decodable one-to-one codes (UDOOCs), analyzing their combinatorial properties, bounds on average codeword length, and providing efficient algorithms, demonstrating their potential for effective data compression.
Contribution
It introduces a comprehensive analysis of UDOOCs, including enumeration formulas, bounds on average length, and practical encoding/decoding algorithms, highlighting their compression efficiency.
Findings
UDOOCs can achieve compression rates comparable to Huffman codes.
They can yield smaller average codeword lengths than Lempel-Ziv codes.
Theoretical bounds are established for sources with infinite alphabets.
Abstract
In this paper, we consider the so-called uniquely decodable one-to-one code (UDOOC) that is formed by inserting a "comma" indicator, termed the unique word (UW), between consecutive one-to-one codewords for separation. Along this research direction, we first investigate several general combinatorial properties of UDOOCs, in particular the enumeration of the number of UDOOC codewords for any (finite) codeword length. Based on the obtained formula on the number of length-n codewords for a given UW, the per-letter average codeword length of UDOOC for the optimal compression of a given source statistics can be computed. Several upper bounds on the average codeword length of such UDOOCs are next established. The analysis on the bounds of average codeword length then leads to two asymptotic bounds for sources having infinitely many alphabets, one of which is achievable and hence tight for a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgorithms and Data Compression · DNA and Biological Computing · Genomic variations and chromosomal abnormalities
