A Myhill-Nerode Theorem for Generalized Automata, with Applications to Pattern Matching and Compression
Nicola Cotumaccio

TL;DR
This paper extends the Myhill-Nerode theorem to generalized automata, providing a theoretical foundation and practical applications in pattern matching and data compression.
Contribution
It introduces a full Myhill-Nerode theorem for generalized automata and demonstrates applications to efficient pattern matching and compression.
Findings
Established a full Myhill-Nerode theorem for generalized automata.
Showed Wheeler generalized automata can be stored efficiently.
Pattern matching queries can be performed in logarithmic time.
Abstract
The model of generalized automata, introduced by Eilenberg in 1974, allows representing a regular language more concisely than conventional automata by allowing edges to be labeled not only with characters, but also strings. Giammarresi and Montalbano introduced a notion of determinism for generalized automata [STACS 1995]. While generalized deterministic automata retain many properties of conventional deterministic automata, the uniqueness of a minimal generalized deterministic automaton is lost. In the first part of the paper, we show that the lack of uniqueness can be explained by introducing a set associated with a generalized automaton . In this way, we derive for the first time a full Myhill-Nerode theorem for generalized automata, which contains the textbook Myhill-Nerode theorem for conventional automata as a degenerate case. In the second…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
