Using Pseudo-Stochastic Rational Languages in Probabilistic Grammatical Inference
Amaury Habrard (LIF), Francois Denis (LIF), Yann Esposito (LIF)

TL;DR
This paper introduces pseudo-stochastic rational languages generated by multiplicity automata, providing a theoretical framework and demonstrating that the DEES algorithm outperforms classical inference methods in probabilistic grammatical inference.
Contribution
It offers a new class of rational languages, pseudo-stochastic rational languages, and proves their properties, while also empirically showing DEES's superior performance over existing algorithms.
Findings
Pseudo-stochastic rational languages are decidable in polynomial time.
DEES outperforms classical algorithms like ALERGIA and MDI in most experiments.
The class of pseudo-stochastic languages broadens the scope of probabilistic grammatical inference.
Abstract
In probabilistic grammatical inference, a usual goal is to infer a good approximation of an unknown distribution P called a stochastic language. The estimate of P stands in some class of probabilistic models such as probabilistic automata (PA). In this paper, we focus on probabilistic models based on multiplicity automata (MA). The stochastic languages generated by MA are called rational stochastic languages; they strictly include stochastic languages generated by PA; they also admit a very concise canonical representation. Despite the fact that this class is not recursively enumerable, it is efficiently identifiable in the limit by using the algorithm DEES, introduced by the authors in a previous paper. However, the identification is not proper and before the convergence of the algorithm, DEES can produce MA that do not define stochastic languages. Nevertheless, it is possible to use…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Modeling and Causal Inference · Machine Learning and Algorithms · Formal Methods in Verification
