Subsequence Matching and LCS with Segment Number Constraints
Yuki Yonemoto, Takuya Mieno, Shunsuke Inenaga, Ryo Yoshinaka, and, Ayumi Shinohara

TL;DR
This paper explores the segmental longest common subsequence (SegLCS) and pattern matching problems, providing new algorithms and complexity bounds for these problems with segment number constraints.
Contribution
It introduces a near-quadratic time algorithm for segmental subsequence pattern matching and establishes a conditional lower bound, advancing understanding of these problems.
Findings
O(mn) time algorithm for segmental subsequence pattern matching
Conditional lower bound of O((mn)^{1-ε}) under SETH
Efficient algorithm for SegLCS with time complexity depending on solution length
Abstract
The longest common subsequence (LCS) is a fundamental problem in string processing which has numerous algorithmic studies, extensions, and applications. A sequence of strings s said to be an (-)segmentation of a string if . Li et al. [BIBM 2022] proposed a new variant of the LCS problem for given strings and an integer , which we hereby call the segmental LCS problem (SegLCS), of finding (the length of) a longest string that has an -segmentation which can be embedded into both and . Li et al. [IJTCS-FAW 2024] gave a dynamic programming solution that solves SegLCS in time with space, where , , and . Recently, Banerjee et al. [ESA 2024] presented an algorithm which, for a constant , solves SegLCS in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgorithms and Data Compression · Machine Learning in Bioinformatics · Network Packet Processing and Optimization
