Co-lexicographically Ordering Automata and Regular Languages -- Part II
Nicola Cotumaccio, Giovanna D'Agostino, Alberto Policriti, Nicola, Prezza

TL;DR
This paper investigates the hierarchy of p-sortability in regular languages, establishing bounds and providing a polynomial-time algorithm to optimally index NFAs based on their width, enhancing efficiency in regular language indexing.
Contribution
It introduces a strict hierarchy of p-sortable languages, proves bounds relating NFAs and DFAs, and presents a polynomial-time algorithm for optimal NFA indexing without computing width.
Findings
Hierarchy of p-sortable languages is strict and non-collapsing.
Bounds relate minimal widths of NFAs and DFAs exponentially.
Polynomial-time algorithm indexes NFAs optimally based on width.
Abstract
In the present work, we tackle the regular language indexing problem by first studying the hierarchy of -sortable languages: regular languages accepted by automata of width . We show that the hierarchy is strict and does not collapse, and provide (exponential in ) upper and lower bounds relating the minimum widths of equivalent NFAs and DFAs. Our bounds indicate the importance of being able to index NFAs, as they enable indexing regular languages with much faster and smaller indexes. Our second contribution solves precisely this problem, optimally: we devise a polynomial-time algorithm that indexes any NFA with the optimal value for its width, without explicitly computing (NP-hard to find). In particular, this implies that we can index in polynomial time the well-studied case (Wheeler NFAs). More in general, in polynomial time we can build an index breaking the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topicssemigroups and automata theory · Algorithms and Data Compression · DNA and Biological Computing
