V-Words, Lyndon Words and Galois Words
Jacqueline W. Daykin, Neerja Mhaskar, and W. F. Smyth

TL;DR
This paper explores the structure of special string families called circ-UMFFs, their connection to Lyndon words and V-order, and extends these concepts to general total orders, with applications in text processing.
Contribution
It introduces the substring circ-UMFF, generalizes V-order to any total order, and investigates their combinatorial properties and applications.
Findings
Defined substring circ-UMFF and extended V-order concepts.
Established connections between Lyndon words and V-order.
Proposed applications in text indexing and compression.
Abstract
We say that a family of strings over forms a Unique Maximal Factorization Family (UMFF) if and only if every has a unique maximal factorization. Further, an UMFF is called a circ-UMFF whenever it contains exactly one rotation of every primitive string . -order is a non-lexicographical total ordering on strings that determines a circ-UMFF. In this paper we propose a generalization of circ-UMFF called the substring circ-UMFF and extend combinatorial research on -order by investigating connections to Lyndon words. Then we extend these concepts to any total order. Applications of this research arise in efficient text indexing, compression, and search problems.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsLogic, programming, and type systems
