A Markov-Chain Characterization of Finite-State Dimension and a Generalization of Agafonov's Theorem
Laurent Bienvenu, Hugo Gimbert, Subin Pulari

TL;DR
This paper provides a new information-theoretic characterization of finite-state dimension using Markov chain distributions and extends Agafonov's theorem to arbitrary sequences, linking their finite-state dimensions.
Contribution
It generalizes the Schnorr-Stimm theorem by characterizing finite-state dimension via Kullback-Leibler divergence and extends Agafonov's theorem to all sequences.
Findings
Finite-state dimension characterized by divergence between Markov chain distributions.
Extended Agafonov's theorem to arbitrary sequences with quantitative relationships.
Provided a new perspective on normality and automata-based sequence analysis.
Abstract
Finite-state dimension quantifies the asymptotic rate of information in an infinite sequence as perceived by finite automata. For a fixed alphabet, the infinite sequences that have maximal finite-state dimension are exactly those that are Borel normal, i.e., in which all words of any given length appear with the same frequency. A theorem of Schnorr and Stimm (1972) shows that a real number is Borel normal if and only if, for every finite-state irreducible Markov chain with fair transitions, when the chain is simulated using the binary expansion of the given number, the empirical distribution of states converges to its stationary distribution. In this paper we extend this correspondence beyond normal numbers. We show that the finite-state dimension of a sequence can be characterized in terms of the conditional Kullback-Leibler divergence between the limiting distributions arising from…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topicssemigroups and automata theory · Algorithms and Data Compression · Computability, Logic, AI Algorithms
