Aligning Instruction Tasks Unlocks Large Language Models as Zero-Shot   Relation Extractors

Kai Zhang; Bernal Jim\'enez Guti\'errez; Yu Su

arXiv:2305.11159·cs.CL·May 19, 2023·6 cites

Aligning Instruction Tasks Unlocks Large Language Models as Zero-Shot Relation Extractors

Kai Zhang, Bernal Jim\'enez Guti\'errez, Yu Su

PDF

Open Access 1 Repo

TL;DR

This paper introduces QA4RE, a framework that aligns relation extraction with question answering tasks, significantly enhancing large language models' zero-shot performance on RE by leveraging common instruction-tuning datasets.

Contribution

The paper proposes QA4RE, a novel approach that improves zero-shot relation extraction in LLMs by aligning it with question answering, addressing the low representation of RE in instruction datasets.

Findings

01

QA4RE consistently improves LLM zero-shot RE performance

02

LLMs outperform strong zero-shot baselines with QA4RE

03

Framework shows robustness and transferability

Abstract

Recent work has shown that fine-tuning large language models (LLMs) on large-scale instruction-following datasets substantially improves their performance on a wide range of NLP tasks, especially in the zero-shot setting. However, even advanced instruction-tuned LLMs still fail to outperform small LMs on relation extraction (RE), a fundamental information extraction task. We hypothesize that instruction-tuning has been unable to elicit strong RE capabilities in LLMs due to RE's low incidence in instruction-tuning datasets, making up less than 1% of all tasks (Wang et al., 2022). To address this limitation, we propose QA4RE, a framework that aligns RE with question answering (QA), a predominant task in instruction-tuning datasets. Comprehensive zero-shot RE experiments over four datasets with two series of instruction-tuned LLMs (six LLMs in total) demonstrate that our QA4RE framework…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

osu-nlp-group/qa4re
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification

Methodsfail