Towards Realistic Low-resource Relation Extraction: A Benchmark with   Empirical Baseline Study

Xin Xu; Xiang Chen; Ningyu Zhang; Xin Xie; Xi Chen; Huajun Chen

arXiv:2210.10678·cs.CL·September 19, 2023

Towards Realistic Low-resource Relation Extraction: A Benchmark with Empirical Baseline Study

Xin Xu, Xiang Chen, Ningyu Zhang, Xin Xie, Xi Chen, Huajun Chen

PDF

Open Access 2 Repos

TL;DR

This paper conducts an empirical study on low-resource relation extraction using pre-trained language models, evaluating prompt methods, balancing techniques, and data augmentation across a diverse benchmark of 8 datasets.

Contribution

It introduces a comprehensive benchmark for low-resource relation extraction and systematically compares various schemes to identify effective strategies.

Findings

01

Prompt-based tuning shows potential but needs improvement.

02

Balancing methods are not always effective for long-tailed data.

03

Data augmentation significantly improves performance, self-training less so.

Abstract

This paper presents an empirical study to build relation extraction systems in low-resource settings. Based upon recent pre-trained language models, we comprehensively investigate three schemes to evaluate the performance in low-resource settings: (i) different types of prompt-based methods with few-shot labeled data; (ii) diverse balancing methods to address the long-tailed distribution issue; (iii) data augmentation technologies and self-training to generate more labeled in-domain data. We create a benchmark with 8 relation extraction (RE) datasets covering different languages, domains and contexts and perform extensive comparisons over the proposed schemes with combinations. Our experiments illustrate: (i) Though prompt-based tuning is beneficial in low-resource RE, there is still much potential for improvement, especially in extracting relations from cross-sentence contexts with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification