Faster Algorithms for Shortest Unique or Absent Substrings
Panagiotis Charalampopoulos, Manal Mohamed, Solon P. Pissis, Hilde Verbeek, Wiktor Zuba

TL;DR
This paper introduces faster algorithms for finding shortest unique and absent substrings in strings, improving over traditional methods especially for small alphabets in the word RAM model.
Contribution
The authors develop $ ext{O}(n rac{ ext{log} \sigma}{ ext{sqrt}( ext{log} n)} )$-time algorithms for SUS and SAS, utilizing techniques like synchronizing sets and wavelet trees.
Findings
New algorithms outperform folklore solutions for small alphabets.
Algorithms reduce SUS and SAS computation to geometric problems.
Efficient construction of de Bruijn sequences enhances SAS computation.
Abstract
We revisit two well-known algorithmic problems on strings: computing a shortest unique substring (SUS) and a shortest absent substring (SAS) of a string of length . Both problems admit folklore -time solutions using the suffix tree of . However, for small alphabets, this complexity is not necessarily optimal in the word RAM model, where a string of length over alphabet can be stored in space and read in time. We present an -time algorithm for computing a SUS of . This algorithm decomposes the problem according to the length and the period of the sought substring and uses several tools and techniques, such as synchronizing sets, the analysis of runs, and wavelet trees, to reduce the computation of a SUS to a simple geometric…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
