Generating CCG Categories

Yufang Liu; Tao Ji; Yuanbin Wu; Man Lan

arXiv:2103.08139·cs.CL·March 16, 2021·1 cites

Generating CCG Categories

Yufang Liu, Tao Ji, Yuanbin Wu, Man Lan

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a category generation approach for CCG supertagging, decomposing categories into atomic sequences to improve robustness and performance, achieving state-of-the-art results on standard benchmarks.

Contribution

It proposes a novel sequence generation method for CCG categories, capturing internal structures and sharing annotations, leading to improved accuracy and robustness.

Findings

01

Achieved 95.5% accuracy in supertagging

02

Attained 89.8% labeled F1 in parsing

03

Performed well on infrequent and out-of-domain categories

Abstract

Previous CCG supertaggers usually predict categories using multi-class classification. Despite their simplicity, internal structures of categories are usually ignored. The rich semantics inside these structures may help us to better handle relations among categories and bring more robustness into existing supertaggers. In this work, we propose to generate categories rather than classify them: each category is decomposed into a sequence of smaller atomic tags, and the tagger aims to generate the correct sequence. We show that with this finer view on categories, annotations of different categories could be shared and interactions with sentence contexts could be enhanced. The proposed category generator is able to achieve state-of-the-art tagging (95.5% accuracy) and parsing (89.8% labeled F1) performances on the standard CCGBank. Furthermore, its performances on infrequent (even unseen)…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Yufang-Liu/category-generator
noneOfficial

Videos

Generating CCG Categories· underline

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification