Minimal Unique Substrings and Minimal Absent Words in a Sliding Window
Takuya Mieno, Yuki Kuhara, Tooru Akagi, Yuta Fujishige, Yuto, Nakashima, Shunsuke Inenaga, Hideo Bannai, Masayuki Takeda

TL;DR
This paper investigates algorithms for efficiently computing minimal unique substrings and minimal absent words within a sliding window over a string, providing new bounds and an improved algorithm for MUSs.
Contribution
It introduces an $O(n ext{log}\sigma)$-time algorithm for MUSs in sliding windows and tight bounds on MAWs changes, improving previous results.
Findings
Developed an $O(n ext{log}\sigma)$-time algorithm for MUSs in sliding windows.
Established tight bounds on the maximum changes in MAWs in sliding windows.
Improved upon previous bounds for MAWs changes in sliding window scenarios.
Abstract
A substring of a string is called a minimal unique substring (MUS) of if occurs exactly once in and any proper substring of occurs at least twice in . A string is called a minimal absent word (MAW) of if does not occur in and any proper substring of occurs in . In this paper, we study the problems of computing MUSs and MAWs in a sliding window over a given string . We first show how the set of MUSs can change in a sliding window over , and present an -time and -space algorithm to compute MUSs in a sliding window of width over , where is the maximum number of distinct characters in every window. We then give tight upper and lower bounds on the maximum number of changes in the set of MAWs in a sliding window over . Our bounds improve on the previous results in [Crochemore et al., 2017].
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgorithms and Data Compression · DNA and Biological Computing · semigroups and automata theory
