On Prefix Normal Words and Prefix Normal Forms
P\'eter Burcsi, Gabriele Fici, Zsuzsanna Lipt\'ak, Frank Ruskey, Joe, Sawada

TL;DR
This paper studies prefix normal words, characterizes their relation to Parikh vectors, analyzes their language complexity, provides enumeration bounds, and explores their properties and open problems.
Contribution
It introduces a characterization of binary words with the same factor Parikh vectors using prefix normal words and analyzes their language and enumeration properties.
Findings
The language of prefix normal words is not context-free.
Bounds on the number of prefix normal words of length n are established.
The generating function for fixed density prefix normal words is rational.
Abstract
A -prefix normal word is a binary word with the property that no factor has more s than the prefix of the same length; a -prefix normal word is defined analogously. These words arise in the context of indexed binary jumbled pattern matching, where the aim is to decide whether a word has a factor with a given number of s and s (a given Parikh vector). Each binary word has an associated set of Parikh vectors of the factors of the word. Using prefix normal words, we provide a characterization of the equivalence class of binary words having the same set of Parikh vectors of their factors. We prove that the language of prefix normal words is not context-free and is strictly contained in the language of pre-necklaces, which are prefixes of powers of Lyndon words. We give enumeration results on , the number of prefix normal words of length , showing that,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
