A Note on Probabilistic Models over Strings: the Linear Algebra Approach
Alexandre Bouchard-C\^ot\'e

TL;DR
This paper presents a linear algebra-based proof for the complexity of inference in probabilistic string models, offering a new perspective that simplifies extensions and impacts algorithm implementation.
Contribution
It introduces a novel linear algebra approach to analyze the complexity of inference in string-valued probabilistic models, specifically revisiting TKF91.
Findings
New linear algebra proof of inference complexity for TKF91
Simplified extension to other models
Implications for inference algorithm efficiency
Abstract
Probabilistic models over strings have played a key role in developing methods allowing indels to be treated as phylogenetically informative events. There is an extensive literature on using automata and transducers on phylogenies to do inference on these probabilistic models, in which an important theoretical question in the field is the complexity of computing the normalization of a class of string-valued graphical models. This question has been investigated using tools from combinatorics, dynamic programming, and graph theory, and has practical applications in Bayesian phylogenetics. In this work, we revisit this theoretical question from a different point of view, based on linear algebra. The main contribution is a new proof of a known result on the complexity of inference on TKF91, a well-known probabilistic model over strings. Our proof uses a different approach based on classical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
