Subsequence Matching and Analysis Problems for Formal Languages

Szil\'ard Zsolt Fazekas; Tore Ko{\ss}; Florin Manea; Robert; Merca\c{s}; Timo Specht

arXiv:2410.07992·cs.FL·October 11, 2024

Subsequence Matching and Analysis Problems for Formal Languages

Szil\'ard Zsolt Fazekas, Tore Ko{\ss}, Florin Manea, Robert, Merca\c{s}, Timo Specht

PDF

Open Access

TL;DR

This paper investigates the computational complexity of subsequence problems in formal languages, providing decidability results and algorithms for regular, context-free, and certain intermediate language classes.

Contribution

It extends previous work by establishing decidability and algorithms for subsequence problems in context-free languages and exploring intermediate language classes.

Findings

01

Problems are decidable for context-free languages.

02

Efficient algorithms are provided for certain language classes.

03

Problems are undecidable for context-sensitive languages.

Abstract

In this paper, we study a series of algorithmic problems related to the subsequences occurring in the strings of a given language, under the assumption that this language is succinctly represented by a grammar generating it, or an automaton accepting it. In particular, we focus on the following problems: Given a string $w$ and a language $L$ , does there exist a word of $L$ which has $w$ as subsequence? Do all words of $L$ have $w$ as a subsequence? Given an integer $k$ alongside $L$ , does there exist a word of $L$ which has all strings of length $k$ , over the alphabet of $L$ , as subsequences? Do all words of $L$ have all strings of length $k$ as subsequences? For the last two problems, efficient algorithms were already presented in [Adamson et al., ISAAC 2023] for the case when $L$ is a regular language, and efficient solutions can be easily obtained for the first two problems. We…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAlgorithms and Data Compression · semigroups and automata theory · Natural Language Processing Techniques