# A theory of incremental compression

**Authors:** Arthur Franz, Oleksandr Antonenko, Roman Soletskyi

arXiv: 1908.03781 · 2020-09-15

## TL;DR

This paper introduces a theory of incremental data compression based on feature detection, resulting in near-optimal partitioning of information, and presents a practical algorithm called ALICE with formal complexity analysis.

## Contribution

It proposes a novel incremental compression framework using features, links it to Kolmogorov complexity, and introduces the ALICE algorithm with complexity bounds.

## Key findings

- The partitioning approach approximates Kolmogorov complexity.
- ALICE algorithm is computable with analyzed time complexity.
- Features relate to Martin-Löf randomness tests, formalizing properties of objects.

## Abstract

The ability to find short representations, i.e. to compress data, is crucial for many intelligent systems. We present a theory of incremental compression showing that arbitrary data strings, that can be described by a set of features, can be compressed by searching for those features incrementally, which results in a partition of the information content of the string into a complete set of pairwise independent pieces. The description length of this partition turns out to be close to optimal in terms of the Kolmogorov complexity of the string. Exploiting this decomposition, we introduce ALICE - a computable ALgorithm for Incremental ComprEssion - and derive an expression for its time complexity. Finally, we show that our concept of a feature is closely related to Martin-L\"of randomness tests, thereby formalizing the meaning of "property" for computable objects.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1908.03781/full.md

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/1908.03781/full.md

## References

30 references — full list in the complete paper: https://tomesphere.com/paper/1908.03781/full.md

---
Source: https://tomesphere.com/paper/1908.03781