Co-lexicographically Ordering Automata and Regular Languages -- Part I
Nicola Cotumaccio, Giovanna D'Agostino, Alberto Policriti, Nicola, Prezza

TL;DR
This paper introduces a new measure called co-lex width for automata, showing it captures complexity and enables more efficient algorithms for automata problems, with implications for automata minimization and regular expression matching.
Contribution
It defines co-lex width as a complexity measure for automata, linking it to automata size, encoding, and matching efficiency, and establishes foundational theorems for co-lexicographically ordered languages.
Findings
Co-lex width captures automata complexity and relates to automata size and encoding.
PSPACE-hard problems become fixed-parameter tractable in co-lex width.
Regular expression matching complexity bounds are affected by co-lex width.
Abstract
In the present work, we lay out a new theory showing that all automata can always be co-lexicographically partially ordered, and an intrinsic measure of their complexity can be defined and effectively determined, namely, the minimum width of one of their admissible co-lex partial orders - dubbed here the automaton's co-lex width. We first show that this new measure captures at once the complexity of several seemingly-unrelated hard problems on automata. Any NFA of co-lex width : (i) has an equivalent powerset DFA whose size is exponential in rather than (as a classic analysis shows) in the NFA's size; (ii) can be encoded using just bits per transition; (iii) admits a linear-space data structure solving regular expression matching queries in time proportional to per matched character. Some consequences of this new parametrization of automata are that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topicssemigroups and automata theory · Network Packet Processing and Optimization · DNA and Biological Computing
