Probabilistic Arithmetic Automata and their Applications
Tobias Marschall, Inke Herms, Hans-Michael Kaltenbach, Sven, Rahmann

TL;DR
This paper introduces probabilistic arithmetic automata (PAAs), a versatile framework for modeling and analyzing probabilistic computations across various applications like pattern matching, peptide analysis, and sequencing, with exact distribution calculation algorithms.
Contribution
It presents a unifying framework of PAAs for diverse probabilistic problems, introduces deterministic automata for constructing PAAs, and demonstrates multiple applications in computational biology.
Findings
PAAs can model various probabilistic processes in biology and computer science.
Algorithms for exact distribution calculations are provided.
The framework is flexible and applicable to multiple domains.
Abstract
We present probabilistic arithmetic automata (PAAs), a general model to describe chains of operations whose operands depend on chance, along with two different algorithms to exactly calculate the distribution of the results obtained by such probabilistic calculations. PAAs provide a unifying framework to approach many problems arising in computational biology and elsewhere. Here, we present five different applications, namely (1) pattern matching statistics on random texts, including the computation of the distribution of occurrence counts, waiting time and clump size under HMM background models; (2) exact analysis of window-based pattern matching algorithms; (3) sensitivity of filtration seeds used to detect candidate sequence alignments; (4) length and mass statistics of peptide fragments resulting from enzymatic cleavage reactions; and (5) read length statistics of 454 sequencing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgorithms and Data Compression · Genomics and Phylogenetic Studies · Chemical Synthesis and Analysis
