Linear Time Construction of Cover Suffix Tree and Applications
Jakub Radoszewski

TL;DR
This paper presents an optimal linear-time algorithm for constructing the Cover Suffix Tree of a string over an integer alphabet, enabling efficient computation of various quasiperiodic structures and pattern occurrences.
Contribution
It introduces a novel linear-time algorithm for constructing the Cover Suffix Tree, improving previous methods and facilitating faster computation of seeds and partial covers.
Findings
CST can be built in O(n) time for integer alphabets.
New characterization of overlapping occurrences using runs.
Efficient indexing for pattern occurrence reporting.
Abstract
The Cover Suffix Tree (CST) of a string is the suffix tree of with additional explicit nodes corresponding to halves of square substrings of . In the CST an explicit node corresponding to a substring of is annotated with two numbers: the number of non-overlapping consecutive occurrences of and the total number of positions in that are covered by occurrences of in . Kociumaka et al. (Algorithmica, 2015) have shown how to compute the CST of a length- string in time. We show how to compute the CST in time assuming that is over an integer alphabet. Kociumaka et al. (Algorithmica, 2015; Theor. Comput. Sci., 2018) have shown that knowing the CST of a length- string , one can compute a linear-sized representation of all seeds of as well as all shortest -partial covers and seeds in for a given in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
