Estimating the Entropy Rate of Finite Markov Chains with Application to Behavior Studies
Brian Vegetabile, Jenny Molet, Tallie Z. Baram, and Hal Stern

TL;DR
This paper compares three estimators for the entropy rate of finite Markov chains, including two that require known order and one based on Lempel-Ziv compression, to analyze behavioral predictability.
Contribution
It provides a comprehensive comparison of three entropy rate estimators for Markov processes, including a bootstrap method for standard errors, highlighting their advantages and limitations.
Findings
Two estimators are consistent for short sequences but require known order.
The Lempel-Ziv based method is order-agnostic but biased for short sequences.
Combined use of methods offers a clearer understanding of behavioral entropy.
Abstract
Predictability of behavior has emerged an an important characteristic in many fields including biology, medicine, and marketing. Behavior can be recorded as a sequence of actions performed by an individual over a given time period. This sequence of actions can often be modeled as a stationary time-homogeneous Markov chain and the predictability of the individual's behavior can be quantified by the entropy rate of the process. This paper provides a comprehensive investigation of three estimators of the entropy rate of finite Markov processes and a bootstrap procedure for providing standard errors. The first two methods directly estimate the entropy rate through estimates of the transition matrix and stationary distribution of the process; the methods differ in the technique used to estimate the stationary distribution. The third method is related to the sliding-window Lempel-Ziv (SWLZ)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
