Counting Lyndon Subsequences
Ryo Hirakawa, Yuto Nakashima, Shunsuke Inenaga, Masayuki Takeda

TL;DR
This paper investigates the enumeration of Lyndon subsequences within strings, establishing maximum and expected counts, and analyzing the diversity of such subsequences to deepen understanding in string combinatorics.
Contribution
It introduces the problem of counting Lyndon subsequences and provides results on maximum, expected, and distinct counts, advancing the theoretical understanding of Lyndon structures.
Findings
Maximum total number of Lyndon subsequences in a string
Expected total number of Lyndon subsequences in a string
Expected number of distinct Lyndon subsequences
Abstract
Counting substrings/subsequences that preserve some property (e.g., palindromes, squares) is an important mathematical interest in stringology. Recently, Glen et al. studied the number of Lyndon factors in a string. A string is called a Lyndon word if it is the lexicographically smallest among all of its conjugates . In this paper, we consider a more general problem "counting Lyndon subsequences". We show (1) the maximum total number of Lyndon subsequences in a string, (2) the expected total number of Lyndon subsequences in a string, (3) the expected number of distinct Lyndon subsequences in a string.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgorithms and Data Compression · Natural Language Processing Techniques · semigroups and automata theory
