A Myhill-Nerode Theorem for Generalized Automata, with Applications to Pattern Matching and Compression

Nicola Cotumaccio

arXiv:2302.06506·cs.FL·April 23, 2026

A Myhill-Nerode Theorem for Generalized Automata, with Applications to Pattern Matching and Compression

Nicola Cotumaccio

PDF

TL;DR

This paper extends the Myhill-Nerode theorem to generalized automata, providing a theoretical foundation and practical applications in pattern matching and data compression.

Contribution

It introduces a full Myhill-Nerode theorem for generalized automata and demonstrates applications to efficient pattern matching and compression.

Findings

01

Established a full Myhill-Nerode theorem for generalized automata.

02

Showed Wheeler generalized automata can be stored efficiently.

03

Pattern matching queries can be performed in logarithmic time.

Abstract

The model of generalized automata, introduced by Eilenberg in 1974, allows representing a regular language more concisely than conventional automata by allowing edges to be labeled not only with characters, but also strings. Giammarresi and Montalbano introduced a notion of determinism for generalized automata [STACS 1995]. While generalized deterministic automata retain many properties of conventional deterministic automata, the uniqueness of a minimal generalized deterministic automaton is lost. In the first part of the paper, we show that the lack of uniqueness can be explained by introducing a set $W (A)$ associated with a generalized automaton $A$ . In this way, we derive for the first time a full Myhill-Nerode theorem for generalized automata, which contains the textbook Myhill-Nerode theorem for conventional automata as a degenerate case. In the second…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.