Ultimate limit on learning non-Markovian behavior: Fisher information   rate and excess information

Paul M. Riechers

arXiv:2310.03968·cs.LG·October 9, 2023·1 cites

Ultimate limit on learning non-Markovian behavior: Fisher information rate and excess information

Paul M. Riechers

PDF

Open Access

TL;DR

This paper derives exact formulas for the fundamental limits of learning parameters in stochastic processes, revealing how Fisher information rate governs the scaling of estimation variance with data length, even for complex non-Markovian systems.

Contribution

It provides closed-form expressions for Fisher information rate in non-Markovian processes and characterizes the convergence modes of information and entropy rates.

Findings

01

Exact Fisher information rate formula for infinite Markov order processes

02

Minimal variance scales as inverse square of observation length

03

Convergence timescales match those of entropy rate relaxation

Abstract

We address the fundamental limits of learning unknown parameters of any stochastic process from time-series data, and discover exact closed-form expressions for how optimal inference scales with observation length. Given a parametrized class of candidate models, the Fisher information of observed sequence probabilities lower-bounds the variance in model estimation from finite data. As sequence-length increases, the minimal variance scales as the square inverse of the length -- with constant coefficient given by the information rate. We discover a simple closed-form expression for this information rate, even in the case of infinite Markov order. We furthermore obtain the exact analytic lower bound on model variance from the observation-induced metadynamic among belief states. We discover ephemeral, exponential, and more general modes of convergence to the asymptotic information rate.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Statistical Mechanics and Entropy · Diffusion and Search Dynamics