How to Unleash the Power of Large Language Models for Few-shot Relation   Extraction?

Xin Xu; Yuqi Zhu; Xiaohan Wang; Ningyu Zhang

arXiv:2305.01555·cs.CL·June 12, 2023·5 cites

How to Unleash the Power of Large Language Models for Few-shot Relation Extraction?

Xin Xu, Yuqi Zhu, Xiaohan Wang, Ningyu Zhang

PDF

Open Access 2 Repos

TL;DR

This paper explores how large language models like GPT-3.5 can be effectively used for few-shot relation extraction, proposing methods to improve performance and achieve state-of-the-art results.

Contribution

It systematically investigates in-context learning and data generation techniques, introducing task instructions and schema constraints to enhance few-shot relation extraction.

Findings

01

In-context learning performs comparably to prompt learning methods.

02

Data generation with large language models improves performance significantly.

03

Achieves new state-of-the-art results on four relation extraction datasets.

Abstract

Scaling language models have revolutionized widespread NLP tasks, yet little comprehensively explored few-shot relation extraction with large language models. In this paper, we investigate principal methodologies, in-context learning and data generation, for few-shot relation extraction via GPT-3.5 through exhaustive experiments. To enhance few-shot performance, we further propose task-related instructions and schema-constrained data generation. We observe that in-context learning can achieve performance on par with previous prompt learning approaches, and data generation with the large language model can boost previous solutions to obtain new state-of-the-art few-shot results on four widely-studied relation extraction datasets. We hope our work can inspire future research for the capabilities of large language models in few-shot relation extraction. Code is available in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification

Methods15 Ways to Contact How can i speak to someone at Delta Airlines · Multi-Head Attention · Attention Is All You Need · Cosine Annealing · Adam · Layer Normalization · Linear Layer · Dropout · Byte Pair Encoding · Weight Decay