Subsequence Matching and Analysis Problems for Formal Languages
Szil\'ard Zsolt Fazekas, Tore Ko{\ss}, Florin Manea, Robert, Merca\c{s}, Timo Specht

TL;DR
This paper investigates the computational complexity of subsequence problems in formal languages, providing decidability results and algorithms for regular, context-free, and certain intermediate language classes.
Contribution
It extends previous work by establishing decidability and algorithms for subsequence problems in context-free languages and exploring intermediate language classes.
Findings
Problems are decidable for context-free languages.
Efficient algorithms are provided for certain language classes.
Problems are undecidable for context-sensitive languages.
Abstract
In this paper, we study a series of algorithmic problems related to the subsequences occurring in the strings of a given language, under the assumption that this language is succinctly represented by a grammar generating it, or an automaton accepting it. In particular, we focus on the following problems: Given a string and a language , does there exist a word of which has as subsequence? Do all words of have as a subsequence? Given an integer alongside , does there exist a word of which has all strings of length , over the alphabet of , as subsequences? Do all words of have all strings of length as subsequences? For the last two problems, efficient algorithms were already presented in [Adamson et al., ISAAC 2023] for the case when is a regular language, and efficient solutions can be easily obtained for the first two problems. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgorithms and Data Compression · semigroups and automata theory · Natural Language Processing Techniques
