Meta In-Context Learning Makes Large Language Models Better Zero and   Few-Shot Relation Extractors

Guozheng Li; Peng Wang; Jiajun Liu; Yikai Guo; Ke Ji; Ziyu Shang,; Zijie Xu

arXiv:2404.17807·cs.CL·April 30, 2024·1 cites

Meta In-Context Learning Makes Large Language Models Better Zero and Few-Shot Relation Extractors

Guozheng Li, Peng Wang, Jiajun Liu, Yikai Guo, Ke Ji, Ziyu Shang,, Zijie Xu

PDF

Open Access

TL;DR

This paper introduces extsc{Micre}, a meta-training framework that enhances large language models' zero and few-shot relation extraction capabilities by teaching them to learn in context across diverse datasets, significantly improving performance.

Contribution

The paper proposes a novel meta-training approach for LLMs to improve zero and few-shot relation extraction without task-specific tuning at inference.

Findings

01

extsc{Micre} outperforms baseline methods on various RE datasets.

02

Performance gains are more significant with larger models.

03

Diverse meta-training datasets are crucial for effective transfer.

Abstract

Relation extraction (RE) is an important task that aims to identify the relationships between entities in texts. While large language models (LLMs) have revealed remarkable in-context learning (ICL) capability for general zero and few-shot learning, recent studies indicate that current LLMs still struggle with zero and few-shot RE. Previous studies are mainly dedicated to design prompt formats and select good examples for improving ICL-based RE. Although both factors are vital for ICL, if one can fundamentally boost the ICL capability of LLMs in RE, the zero and few-shot RE performance via ICL would be significantly improved. To this end, we introduce \textsc{Micre} (\textbf{M}eta \textbf{I}n-\textbf{C}ontext learning of LLMs for \textbf{R}elation \textbf{E}xtraction), a new meta-training framework for zero and few-shot RE where an LLM is tuned to do ICL on a diverse collection of RE…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Domain Adaptation and Few-Shot Learning

MethodsSparse Evolutionary Training