Gradient Imitation Reinforcement Learning for Low Resource Relation   Extraction

Xuming Hu; Chenwei Zhang; Yawen Yang; Xiaohe Li; Li Lin; Lijie Wen,; Philip S. Yu

arXiv:2109.06415·cs.CL·September 15, 2021

Gradient Imitation Reinforcement Learning for Low Resource Relation Extraction

Xuming Hu, Chenwei Zhang, Yawen Yang, Xiaohe Li, Li Lin, Lijie Wen,, Philip S. Yu

PDF

Open Access 1 Repo

TL;DR

This paper introduces GradLRE, a novel reinforcement learning approach that improves low-resource relation extraction by imitating gradient descent directions, effectively utilizing limited labeled data and data augmentation.

Contribution

The paper proposes a Gradient Imitation Reinforcement Learning framework for low-resource relation extraction, addressing feedback limitations and incorporating data augmentation for scenarios with no unlabeled data.

Findings

01

GradLRE outperforms baseline methods on two public datasets.

02

The method effectively handles both with and without unlabeled data scenarios.

03

Experimental results show significant improvement in low-resource settings.

Abstract

Low-resource Relation Extraction (LRE) aims to extract relation facts from limited labeled corpora when human annotation is scarce. Existing works either utilize self-training scheme to generate pseudo labels that will cause the gradual drift problem, or leverage meta-learning scheme which does not solicit feedback explicitly. To alleviate selection bias due to the lack of feedback loops in existing LRE learning paradigms, we developed a Gradient Imitation Reinforcement Learning method to encourage pseudo label data to imitate the gradient descent direction on labeled data and bootstrap its optimization capability through trial and error. We also propose a framework called GradLRE, which handles two major scenarios in low-resource relation extraction. Besides the scenario where unlabeled data is sufficient, GradLRE handles the situation where no unlabeled data is available, by…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

thu-bpm/gradlre
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Software Engineering Research