Using Pseudo-Stochastic Rational Languages in Probabilistic Grammatical   Inference

Amaury Habrard (LIF); Francois Denis (LIF); Yann Esposito (LIF)

arXiv:cs/0607085·cs.LG·December 19, 2008

Using Pseudo-Stochastic Rational Languages in Probabilistic Grammatical Inference

Amaury Habrard (LIF), Francois Denis (LIF), Yann Esposito (LIF)

PDF

Open Access

TL;DR

This paper introduces pseudo-stochastic rational languages generated by multiplicity automata, providing a theoretical framework and demonstrating that the DEES algorithm outperforms classical inference methods in probabilistic grammatical inference.

Contribution

It offers a new class of rational languages, pseudo-stochastic rational languages, and proves their properties, while also empirically showing DEES's superior performance over existing algorithms.

Findings

01

Pseudo-stochastic rational languages are decidable in polynomial time.

02

DEES outperforms classical algorithms like ALERGIA and MDI in most experiments.

03

The class of pseudo-stochastic languages broadens the scope of probabilistic grammatical inference.

Abstract

In probabilistic grammatical inference, a usual goal is to infer a good approximation of an unknown distribution P called a stochastic language. The estimate of P stands in some class of probabilistic models such as probabilistic automata (PA). In this paper, we focus on probabilistic models based on multiplicity automata (MA). The stochastic languages generated by MA are called rational stochastic languages; they strictly include stochastic languages generated by PA; they also admit a very concise canonical representation. Despite the fact that this class is not recursively enumerable, it is efficiently identifiable in the limit by using the algorithm DEES, introduced by the authors in a previous paper. However, the identification is not proper and before the convergence of the algorithm, DEES can produce MA that do not define stochastic languages. Nevertheless, it is possible to use…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBayesian Modeling and Causal Inference · Machine Learning and Algorithms · Formal Methods in Verification