On context-tree prediction of individual sequences

Jacob Ziv; Neri Merhav

arXiv:cs/0508127·cs.IT·July 13, 2007

On context-tree prediction of individual sequences

Jacob Ziv, Neri Merhav

PDF

Open Access

TL;DR

This paper investigates the use of context-tree methods for universal prediction of individual sequences, analyzing how the growth rate of contexts affects prediction performance and proposing an optimal algorithm for sublinear growth rates.

Contribution

It introduces a universal context-tree prediction algorithm that performs optimally when the number of contexts grows sublinearly with sequence length, and establishes the linear growth rate as a critical threshold.

Findings

01

The critical growth rate of contexts is linear in sequence length.

02

The proposed algorithm achieves near-optimal performance for sublinear growth rates.

03

Linear growth in contexts prevents universal prediction performance.

Abstract

Motivated by the evident success of context-tree based methods in lossless data compression, we explore, in this paper, methods of the same spirit in universal prediction of individual sequences. By context-tree prediction, we refer to a family of prediction schemes, where at each time instant $t$ , after having observed all outcomes of the data sequence $x_{1}, ..., x_{t - 1}$ , but not yet $x_{t}$ , the prediction is based on a ``context'' (or a state) that consists of the $k$ most recent past outcomes $x_{t - k}, ..., x_{t - 1}$ , where the choice of $k$ may depend on the contents of a possibly longer, though limited, portion of the observed past, $x_{t - k_{m a x}}, ... x_{t - 1}$ . This is different from the study reported in [1], where general finite-state predictors as well as ``Markov'' (finite-memory) predictors of fixed order, were studied in the regime of individual sequences. Another important…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAlgorithms and Data Compression · Machine Learning and Algorithms · semigroups and automata theory