On context-tree prediction of individual sequences
Jacob Ziv, Neri Merhav

TL;DR
This paper investigates the use of context-tree methods for universal prediction of individual sequences, analyzing how the growth rate of contexts affects prediction performance and proposing an optimal algorithm for sublinear growth rates.
Contribution
It introduces a universal context-tree prediction algorithm that performs optimally when the number of contexts grows sublinearly with sequence length, and establishes the linear growth rate as a critical threshold.
Findings
The critical growth rate of contexts is linear in sequence length.
The proposed algorithm achieves near-optimal performance for sublinear growth rates.
Linear growth in contexts prevents universal prediction performance.
Abstract
Motivated by the evident success of context-tree based methods in lossless data compression, we explore, in this paper, methods of the same spirit in universal prediction of individual sequences. By context-tree prediction, we refer to a family of prediction schemes, where at each time instant , after having observed all outcomes of the data sequence , but not yet , the prediction is based on a ``context'' (or a state) that consists of the most recent past outcomes , where the choice of may depend on the contents of a possibly longer, though limited, portion of the observed past, . This is different from the study reported in [1], where general finite-state predictors as well as ``Markov'' (finite-memory) predictors of fixed order, were studied in the regime of individual sequences. Another important…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgorithms and Data Compression · Machine Learning and Algorithms · semigroups and automata theory
