Absent Subsequences in Words
Maria Kosche, Tore Ko{\ss}, Florin Manea, Stefan Siemer

TL;DR
This paper investigates the properties and algorithms related to absent subsequences in words, focusing on minimal and shortest absent subsequences, with combinatorial characterizations and efficient computational methods.
Contribution
It introduces new characterizations, representations, and algorithms for identifying and computing minimal and shortest absent subsequences in strings.
Findings
Provided combinatorial characterizations of minimal and shortest absent subsequences.
Developed efficient algorithms for testing and computing these absent subsequences.
Designed data structures for quick queries on shortest absent subsequences within string factors.
Abstract
An absent factor of a string is a string which does not occur as a contiguous substring (a.k.a. factor) inside . We extend this well-studied notion and define absent subsequences: a string is an absent subsequence of a string if does not occur as subsequence (a.k.a. scattered factor) inside . Of particular interest to us are minimal absent subsequences, i.e., absent subsequences whose every subsequence is not absent, and shortest absent subsequences, i.e., absent subsequences of minimal length. We show a series of combinatorial and algorithmic results regarding these two notions. For instance: we give combinatorial characterisations of the sets of minimal and, respectively, shortest absent subsequences in a word, as well as compact representations of these sets; we show how we can test efficiently if a string is a shortest or minimal absent subsequence in a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
