Ordering regular languages: a danger zone
Giovanna D'Agostino, Davide Martincigh, Alberto Policriti

TL;DR
This paper explores Wheeler automata and languages, showing their computational advantages and limitations, and establishing complexity bounds for related decision problems in automata theory.
Contribution
It provides upper bounds for problem complexity in Wheeler automata and demonstrates hardness results when generalizing beyond Wheeler constraints.
Findings
Polynomial-time algorithms for Wheeler automata problems
Deciding Wheeler automaton membership is polynomial
Generalizations lead to NP-complete or PSPACE-complete problems
Abstract
Ordering the collection of states of a given automaton starting from an order of the underlying alphabet is a natural move towards a computational treatment of the language accepted by the automaton. Along this path, Wheeler \emph{graphs} have been recently introduced as an extension/adaptation of the Burrows-Wheeler Transform (the now famous BWT, originally defined on strings) to graphs. These graphs constitute an important data-structure for languages, since they allow a very efficient storage mechanism for the transition function of an automaton, while providing a fast support to all sorts of substring queries. This is possible as a consequence of a property -- the so-called \emph{path coherence} -- valid on Wheeler graphs and consisting in an ordering on nodes that "propagates" to (collections of) strings. By looking at a Wheeler graph as an automaton, the ordering on strings…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgorithms and Data Compression · semigroups and automata theory · DNA and Biological Computing
