Formal Languages via Theories over Strings
Joel D. Day, Vijay Ganesh, Nathan Grewal, Florin Manea

TL;DR
This paper explores the expressive power and decidability of formal languages defined by theories over strings, including word equations and length constraints, across various language classes and theories.
Contribution
It provides a comprehensive analysis of the expressive capabilities and decidability issues of formal languages over 20 theories involving string and arithmetic constraints.
Findings
Hierarchies based on presence of word equations
Decidability results for emptiness and universality
Undecidability of language expressibility between theories
Abstract
We investigate the properties of formal languages expressible in terms of formulas over quantifier-free theories of word equations, arithmetic over length constraints, and language membership predicates for the classes of regular, visibly pushdown, and deterministic context-free languages. In total, we consider 20 distinct theories and decidability questions for problems such as emptiness and universality for formal languages over them. First, we discuss their relative expressive power and observe a rough division into two hierarchies based on whether or not word equations are present. Second, we consider the decidability status of several important decision problems, such as emptiness and universality. Note that the emptiness problem is equivalent to the satisfiability problem over the corresponding theory. Third, we consider the problem of whether a language in one theory is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · semigroups and automata theory · Web Application Security Vulnerabilities
