Categorial grammars with unique category assignment
Maxim Vishnikin, Alexander Okhotin

TL;DR
This paper explores a special class of categorial grammars with unique lexical categories, demonstrating their ability to encode all context-free languages through homomorphic mappings, thus revealing their surprising expressive power.
Contribution
It introduces a subclass of categorial grammars with unique category assignment and proves they can encode all context-free languages via homomorphisms, challenging initial assumptions of weakness.
Findings
Unique category assignment does not limit expressive power.
All context-free languages can be encoded using these grammars.
The approach simplifies lexical ambiguity without losing generative capacity.
Abstract
A categorial grammar assigns one of several syntactic categories to each symbol of the alphabet, and the category of a string is then deduced from the categories assigned to its symbols using two simple reduction rules. This paper investigates a special class of categorial grammars, in which only one category is assigned to each symbol, thus eliminating ambiguity on the lexical level (in linguistic terms, a unique part of speech is assigned to each word). While unrestricted categorial grammars are equivalent to the context-free grammars, the proposed subclass initially appears weak, as it cannot define even some regular languages. It is proved in the paper that it is actually powerful enough to define a homomorphic encoding of every context-free language, in the sense that for every context-free language over an alphabet there is a language over some alphabet …
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · semigroups and automata theory · Semantic Web and Ontologies
