Prediction from compression for models with infinite memory, with   applications to hidden Markov and renewal processes

Yanjun Han; Tianze Jiang; Yihong Wu

arXiv:2404.15454·math.ST·April 25, 2024

Prediction from compression for models with infinite memory, with applications to hidden Markov and renewal processes

Yanjun Han, Tianze Jiang, Yihong Wu

PDF

Open Access

TL;DR

This paper develops optimal prediction methods for models with long-term memory, such as hidden Markov and renewal processes, achieving near-optimal risk bounds by extending universal compression ideas.

Contribution

It introduces a new estimator with risk bounds tied to redundancy and long-range dependencies, improving prediction accuracy for models with infinite memory.

Findings

01

Optimal prediction risk for HMMs and renewal processes determined.

02

Polynomial-time estimator for HMMs achieves b8(log n/n) risk.

03

Matching minimax lower bounds established via redundancy and mutual information.

Abstract

Consider the problem of predicting the next symbol given a sample path of length n, whose joint distribution belongs to a distribution class that may have long-term memory. The goal is to compete with the conditional predictor that knows the true model. For both hidden Markov models (HMMs) and renewal processes, we determine the optimal prediction risk in Kullback- Leibler divergence up to universal constant factors. Extending existing results in finite-order Markov models [HJW23] and drawing ideas from universal compression, the proposed estimator has a prediction risk bounded by redundancy of the distribution class and a memory term that accounts for the long-range dependency of the model. Notably, for HMMs with bounded state and observation spaces, a polynomial-time estimator based on dynamic programming is shown to achieve the optimal prediction risk {\Theta}(log n/n); prior to this…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBayesian Methods and Mixture Models · Neural Networks and Applications · Machine Learning and Algorithms