Aligning Instruction Tasks Unlocks Large Language Models as Zero-Shot Relation Extractors
Kai Zhang, Bernal Jim\'enez Guti\'errez, Yu Su

TL;DR
This paper introduces QA4RE, a framework that aligns relation extraction with question answering tasks, significantly enhancing large language models' zero-shot performance on RE by leveraging common instruction-tuning datasets.
Contribution
The paper proposes QA4RE, a novel approach that improves zero-shot relation extraction in LLMs by aligning it with question answering, addressing the low representation of RE in instruction datasets.
Findings
QA4RE consistently improves LLM zero-shot RE performance
LLMs outperform strong zero-shot baselines with QA4RE
Framework shows robustness and transferability
Abstract
Recent work has shown that fine-tuning large language models (LLMs) on large-scale instruction-following datasets substantially improves their performance on a wide range of NLP tasks, especially in the zero-shot setting. However, even advanced instruction-tuned LLMs still fail to outperform small LMs on relation extraction (RE), a fundamental information extraction task. We hypothesize that instruction-tuning has been unable to elicit strong RE capabilities in LLMs due to RE's low incidence in instruction-tuning datasets, making up less than 1% of all tasks (Wang et al., 2022). To address this limitation, we propose QA4RE, a framework that aligns RE with question answering (QA), a predominant task in instruction-tuning datasets. Comprehensive zero-shot RE experiments over four datasets with two series of instruction-tuned LLMs (six LLMs in total) demonstrate that our QA4RE framework…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification
Methodsfail
