Generating CCG Categories
Yufang Liu, Tao Ji, Yuanbin Wu, Man Lan

TL;DR
This paper introduces a category generation approach for CCG supertagging, decomposing categories into atomic sequences to improve robustness and performance, achieving state-of-the-art results on standard benchmarks.
Contribution
It proposes a novel sequence generation method for CCG categories, capturing internal structures and sharing annotations, leading to improved accuracy and robustness.
Findings
Achieved 95.5% accuracy in supertagging
Attained 89.8% labeled F1 in parsing
Performed well on infrequent and out-of-domain categories
Abstract
Previous CCG supertaggers usually predict categories using multi-class classification. Despite their simplicity, internal structures of categories are usually ignored. The rich semantics inside these structures may help us to better handle relations among categories and bring more robustness into existing supertaggers. In this work, we propose to generate categories rather than classify them: each category is decomposed into a sequence of smaller atomic tags, and the tagger aims to generate the correct sequence. We show that with this finer view on categories, annotations of different categories could be shared and interactions with sentence contexts could be enhanced. The proposed category generator is able to achieve state-of-the-art tagging (95.5% accuracy) and parsing (89.8% labeled F1) performances on the standard CCGBank. Furthermore, its performances on infrequent (even unseen)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification
