Detecting Renewal States in Chains of Variable Length via Intrinsic   Bayes Factors

Victor Freguglia; Nancy Garcia

arXiv:2110.07430·cs.LG·January 10, 2022

Detecting Renewal States in Chains of Variable Length via Intrinsic Bayes Factors

Victor Freguglia, Nancy Garcia

PDF

Open Access

TL;DR

This paper introduces a Bayesian method using Intrinsic Bayes Factors to detect renewal states in variable-length Markov chains, enabling the segmentation of sequences into independent blocks.

Contribution

It proposes a novel Bayesian approach with Monte Carlo methods for identifying renewal states in variable-length Markov chains, improving sequence segmentation accuracy.

Findings

01

Effective detection of renewal states demonstrated on artificial datasets.

02

Method successfully applied to linguistic data.

03

Bayesian approach outperforms traditional methods.

Abstract

Markov chains with variable length are useful parsimonious stochastic models able to generate most stationary sequence of discrete symbols. The idea is to identify the suffixes of the past, called contexts, that are relevant to predict the future symbol. Sometimes a single state is a context, and looking at the past and finding this specific state makes the further past irrelevant. States with such property are called renewal states and they can be used to split the chain into independent and identically distributed blocks. In order to identify renewal states for chains with variable length, we propose the use of Intrinsic Bayes Factor to evaluate the hypothesis that some particular state is a renewal state. In this case, the difficulty lies in integrating the marginal posterior distribution for the random context trees for general prior distribution on the space of context trees, with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Speech Recognition and Synthesis · Topic Modeling