The landscape of compressibility measures for two-dimensional data
Lorenzo Carfagna, Giovanni Manzini

TL;DR
This paper extends one-dimensional compressibility measures to two-dimensional data, analyzing their properties, relationships, and applications, including space-efficient data structures and algorithms for matrices.
Contribution
It introduces two-dimensional versions of gamma and delta measures, studies their properties, and develops a linear-time algorithm for constructing two-dimensional block trees.
Findings
delta_2D is monotone and computable in linear time
The gap between delta_2D and gamma_2D can be Ω(√n)
Space usage of 2D block trees can be bounded by gamma_2D and delta_2D
Abstract
In this paper we extend to two-dimensional data two recently introduced one-dimensional compressibility measures: the measure defined in terms of the smallest string attractor, and the measure defined in terms of the number of distinct substrings of the input string. Concretely, we introduce the two-dimensional measures and , as natural generalizations of and , and we initiate the study of their properties. Among other things, we prove that is monotone and can be computed in linear time, and we show that, although it is still true that , the gap between the two measures can be and therefore asymptotically larger than the gap between and . To complete the scenario of two-dimensional compressibility measures, we introduce the measure which…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic processes and statistical mechanics · Mathematical Dynamics and Fractals · Markov Chains and Monte Carlo Methods
