The Generation-Recognition Asymmetry: Six Dimensions of a Fundamental Divide in Formal Language Theory
Romain Peyrichou

TL;DR
This paper explores the fundamental asymmetries between generation and recognition in formal language theory across six dimensions, challenging common assumptions and connecting them to recent NLP developments.
Contribution
It identifies six key dimensions of the generation-recognition divide, introduces two new dimensions, and relates temporal asymmetry to surprisal, offering a comprehensive unified perspective.
Findings
Generation under constraints can be NP-hard.
Parsing is always constrained, generation need not be.
Bidirectional systems have existed for fifty years but are underused.
Abstract
Every formal grammar defines a language and can in principle be used in three ways: to generate strings (production), to recognize them (parsing), or -- given only examples -- to infer the grammar itself (grammar induction). Generation and recognition are extensionally equivalent -- they characterize the same set -- but operationally asymmetric in multiple independent ways. Inference is a qualitatively harder problem: it does not have access to a known grammar. Despite the centrality of this triad to compiler design, natural language processing, and formal language theory, no survey has treated it as a unified, multidimensional phenomenon. We identify six dimensions along which generation and recognition diverge: computational complexity, ambiguity, directionality, information availability, grammar inference, and temporality. We show that the common characterization "generation is easy,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Logic, programming, and type systems · Machine Learning and Algorithms
