Linear Algorithms for Computing the Lyndon Border Array and the Lyndon Suffix Array
Ali Alatabbi, Jacqueline W. Daykin, M. Sohel Rahman

TL;DR
This paper introduces linear-time algorithms for computing the Lyndon Border Array and Lyndon Suffix Array, which are new structures combining border and Lyndon word concepts to analyze string patterns efficiently.
Contribution
It presents the first linear-time algorithms for constructing Lyndon Border Array and Lyndon Suffix Array, advancing pattern analysis in strings with these novel structures.
Findings
Linear algorithms for Lyndon Border Array computation.
Linear algorithms for Lyndon Suffix Array construction.
Efficient analysis of string repetitive structures.
Abstract
We consider the problem of finding repetitive structures and inherent patterns in a given string of length over a finite totally ordered alphabet. A border of a string is both a prefix and a suffix of such that . The computation of the border array of a string , namely the borders of each prefix of , is strongly related to the string matching problem: given a string , find all of its occurrences in . A {\itshape Lyndon word} is a primitive word (i.e., it is not a power of another word) which is minimal for the lexicographical order of its conjugacy class (i.e., the set of words obtained by cyclic rotations of the letters). In this paper we combine these concepts to introduce the \emph{Lyndon Border Array} of , whose -th entry is the length of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgorithms and Data Compression · DNA and Biological Computing · Natural Language Processing Techniques
