Non-erasing Chomsky-Sch{\"u}tzenberger theorem with grammar-independent alphabet
Stefano Crespi Reghizzi, Pierluigi San Pietro

TL;DR
This paper presents a new version of the Chomsky-Schützenberger theorem that is non-erasing and grammar-independent, with a polynomially bounded Dyck alphabet size and strictly locally testable regular language.
Contribution
It introduces a non-erasing, grammar-independent CST variant with polynomial Dyck alphabet size and strictly locally testable regular language, improving previous results.
Findings
Dyck alphabet size is polynomial in the alphabet size, specifically O(|Σ|^{46})
For linear grammars in Double Greibach Normal Form, the polynomial degree reduces to 2
The regular language R is strictly locally testable
Abstract
The famous theorem by Chomsky and Sch\"utzenberger (CST) says that every context-free language over an alphabet is representable as , where is a Dyck language over a set of brackets, is a local language and is an alphabetic homomorphism that erases unboundedly many symbols. Berstel found that the number of erasures can be linearly limited if the grammar is in Greibach normal form; Berstel and Boasson (and later, independently, Okhotin) proved a non-erasing variant of CST for grammars in Double Greibach Normal Form. In all these CST statements, however, the size of the Dyck alphabet depends on the grammar size for . In the Stanley variant of the CST, only depends on and not on the grammar, but the homomorphism erases many more symbols than in the other versions of CST; also, the regular language is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
