Generative Entity Typing with Curriculum Learning

Siyu Yuan; Deqing Yang; Jiaqing Liang; Zhixu Li; Jinxi Liu; Jingyue; Huang; Yanghua Xiao

arXiv:2210.02914·cs.CL·October 19, 2022·1 cites

Generative Entity Typing with Curriculum Learning

Siyu Yuan, Deqing Yang, Jiaqing Liang, Zhixu Li, Jinxi Liu, Jingyue, Huang, Yanghua Xiao

PDF

Open Access 1 Repo

TL;DR

This paper introduces a generative entity typing approach using curriculum learning to improve type assignment, especially for long-tail and zero-shot types, outperforming existing models across multiple languages and tasks.

Contribution

It proposes a novel generative paradigm for entity typing combined with curriculum learning to handle data heterogeneity and improve type granularity.

Findings

01

Outperforms state-of-the-art entity typing models.

02

Effective in few-shot and zero-shot scenarios.

03

Applicable across multiple languages and downstream tasks.

Abstract

Entity typing aims to assign types to the entity mentions in given texts. The traditional classification-based entity typing paradigm has two unignorable drawbacks: 1) it fails to assign an entity to the types beyond the predefined type set, and 2) it can hardly handle few-shot and zero-shot situations where many long-tail types only have few or even no training instances. To overcome these drawbacks, we propose a novel generative entity typing (GET) paradigm: given a text with an entity mention, the multiple types for the role that the entity plays in the text are generated with a pre-trained language model (PLM). However, PLMs tend to generate coarse-grained types after fine-tuning upon the entity typing dataset. Besides, we only have heterogeneous training data consisting of a small portion of human-annotated data and a large portion of auto-generated but low-quality data. To tackle…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

siyuyuan/get
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text and Document Classification Technologies