A memory versus compression ratio trade-off in PPM via compressed   context modeling

M. Oguzhan Kulekci

arXiv:1211.2636·cs.DS·November 13, 2012

A memory versus compression ratio trade-off in PPM via compressed context modeling

M. Oguzhan Kulekci

PDF

Open Access

TL;DR

This paper explores reducing memory usage in PPM compression by using compressed context modeling, achieving significant memory savings with a modest decrease in compression efficiency.

Contribution

It introduces a novel approach to PPM that employs compressed contexts, balancing memory consumption and compression ratio effectively.

Findings

01

Approximately 20-25% memory reduction achieved.

02

Up to 7% decrease in compression ratio.

03

Effective in low-order models.

Abstract

Since its introduction prediction by partial matching (PPM) has always been a de facto gold standard in lossless text compression, where many variants improving the compression ratio and speed have been proposed. However, reducing the high space requirement of PPM schemes did not gain that much attention. This study focuses on reducing the memory consumption of PPM via the recently proposed compressed context modeling that uses the compressed representations of contexts in the statistical model. Differently from the classical context definition as the string of the preceding characters at a particular position, CCM considers context as the amount of preceding information that is actually the bit stream composed by compressing the previous symbols. We observe that by using the CCM, the data structures, particularly the context trees, can be implemented in smaller space, and present a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAlgorithms and Data Compression · Network Packet Processing and Optimization · DNA and Biological Computing