Prediction of Large Alphabet Processes and Its Application to Adaptive   Source Coding

Boris Ryabko; Jaakko Astola

arXiv:cs/0504079·cs.IT·September 29, 2009

Prediction of Large Alphabet Processes and Its Application to Adaptive Source Coding

Boris Ryabko, Jaakko Astola

PDF

Open Access

TL;DR

This paper addresses predicting sequences from large alphabets, proposing a method that improves prediction precision over existing algorithms, with applications to adaptive source coding.

Contribution

It introduces a novel prediction method for large alphabet sources that outperforms known algorithms in terms of Kullback-Leibler divergence.

Findings

01

Enhanced prediction accuracy for large alphabet sequences

02

Method applicable to adaptive source coding

03

Improved divergence bounds compared to prior algorithms

Abstract

The problem of predicting a sequence $x_{1}, x_{2}, ...$ generated by a discrete source with unknown statistics is considered. Each letter $x_{t + 1}$ is predicted using information on the word $x_{1} x_{2} ... x_{t}$ only. In fact, this problem is a classical problem which has received much attention. Its history can be traced back to Laplace. We address the problem where each $x_{i}$ belongs to some large (or even infinite) alphabet. A method is presented for which the precision is greater than for known algorithms, where precision is estimated by the Kullback-Leibler divergence. The results can readily be translated to results about adaptive coding.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAlgorithms and Data Compression · Advanced Data Compression Techniques · Error Correcting Code Techniques