Efficient and Compact Representations of Some Non-Canonical Prefix-Free   Codes

Antonio Fari\~na; Travis Gagie; Szymon Grabowski; Giovanni Manzini,; Gonzalo Navarro; Alberto Ord\'o\~nez

arXiv:1605.06615·cs.DS·April 2, 2021

Efficient and Compact Representations of Some Non-Canonical Prefix-Free Codes

Antonio Fari\~na, Travis Gagie, Szymon Grabowski, Giovanni Manzini,, Gonzalo Navarro, Alberto Ord\'o\~nez

PDF

Open Access

TL;DR

This paper introduces efficient, compact data structures for storing and processing various types of prefix-free codes, enabling fast encoding and decoding with minimal space, applicable in the word-RAM computational model.

Contribution

It presents novel space-efficient representations for optimal and nearly optimal prefix-free codes, including alphabetic and lexicographically ordered variants, with improved encoding and decoding times.

Findings

01

Optimal alphabetic prefix-free code stored in O(σ log L) bits

02

Encoding and decoding in O(min(ℓ, log L)) time

03

Nearly optimal alphabetic code stored in o(σ) bits with constant-time operations

Abstract

For many kinds of prefix-free codes there are efficient and compact alternatives to the traditional tree-based representation. Since these put the codes into canonical form, however, they can only be used when we can choose the order in which codewords are assigned to symbols. In this paper we first show how, given a probability distribution over an alphabet of $σ$ symbols, we can store an optimal alphabetic prefix-free code in $\Oh σ lo g L$ bits such that we can encode and decode any codeword of length $ℓ$ in $\Oh min (ℓ, lo g L)$ time, where $L$ is the maximum codeword length. With $\Oh 2^{L^{ϵ}}$ further bits, for any constant $ϵ > 0$ , we can encode and decode $\Oh lo g ℓ$ time. We then show how to store a nearly optimal alphabetic prefix-free code in $o (\sigma)$ bits such that we can encode and decode in constant time. We also consider a kind of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAlgorithms and Data Compression · DNA and Biological Computing · semigroups and automata theory