Benchmarking Large Language Models with Augmented Instructions for   Fine-grained Information Extraction

Jun Gao; Huan Zhao; Yice Zhang; Wei Wang; Changlong Yu; Ruifeng Xu

arXiv:2310.05092·cs.CL·October 10, 2023·2 cites

Benchmarking Large Language Models with Augmented Instructions for Fine-grained Information Extraction

Jun Gao, Huan Zhao, Yice Zhang, Wei Wang, Changlong Yu, Ruifeng Xu

PDF

Open Access

TL;DR

This paper presents a new benchmark dataset and evaluation framework for fine-grained information extraction using large language models, emphasizing augmented instructions and analyzing model performance and generalization capabilities.

Contribution

It introduces a fine-grained IE benchmark with augmented instructions for LLMs and evaluates different models' abilities to generalize to unseen information types.

Findings

01

Encoder-decoder models like T5 perform well on unseen types.

02

ChatGPT shows high adaptability to new task forms.

03

Model performance depends on architecture, data diversity, and learning techniques.

Abstract

Information Extraction (IE) is an essential task in Natural Language Processing. Traditional methods have relied on coarse-grained extraction with simple instructions. However, with the emergence of Large Language Models (LLMs), there is a need to adapt IE techniques to leverage the capabilities of these models. This paper introduces a fine-grained IE benchmark dataset tailored for LLMs, employing augmented instructions for each information type, which includes task descriptions, extraction rules, output formats, and examples. Through extensive evaluations, we observe that encoder-decoder models, particularly T5 and FLAN-T5, perform well in generalizing to unseen information types, while ChatGPT exhibits greater adaptability to new task forms. Our results also indicate that performance is not solely dictated by model scale, and highlight the significance of architecture, data diversity,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech Recognition and Synthesis

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Linear Layer · Dropout · Multi-Head Attention · Byte Pair Encoding · Attention Is All You Need · SentencePiece · Gated Linear Unit · Attention Dropout · Dense Connections