How to Unleash the Power of Large Language Models for Few-shot Relation Extraction?
Xin Xu, Yuqi Zhu, Xiaohan Wang, Ningyu Zhang

TL;DR
This paper explores how large language models like GPT-3.5 can be effectively used for few-shot relation extraction, proposing methods to improve performance and achieve state-of-the-art results.
Contribution
It systematically investigates in-context learning and data generation techniques, introducing task instructions and schema constraints to enhance few-shot relation extraction.
Findings
In-context learning performs comparably to prompt learning methods.
Data generation with large language models improves performance significantly.
Achieves new state-of-the-art results on four relation extraction datasets.
Abstract
Scaling language models have revolutionized widespread NLP tasks, yet little comprehensively explored few-shot relation extraction with large language models. In this paper, we investigate principal methodologies, in-context learning and data generation, for few-shot relation extraction via GPT-3.5 through exhaustive experiments. To enhance few-shot performance, we further propose task-related instructions and schema-constrained data generation. We observe that in-context learning can achieve performance on par with previous prompt learning approaches, and data generation with the large language model can boost previous solutions to obtain new state-of-the-art few-shot results on four widely-studied relation extraction datasets. We hope our work can inspire future research for the capabilities of large language models in few-shot relation extraction. Code is available in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification
Methods15 Ways to Contact How can i speak to someone at Delta Airlines · Multi-Head Attention · Attention Is All You Need · Cosine Annealing · Adam · Layer Normalization · Linear Layer · Dropout · Byte Pair Encoding · Weight Decay
