MetaIE: Distilling a Meta Model from LLM for All Kinds of Information Extraction Tasks
Letian Peng, Zilong Wang, Feng Yao, Zihan Wang, Jingbo Shang

TL;DR
MetaIE introduces a small, distilled meta-model from LLMs that effectively generalizes across various information extraction tasks, enhancing few-shot learning performance.
Contribution
The paper presents a novel symbolic distillation framework to create a versatile meta-model for IE tasks, outperforming existing pre-training and distillation methods.
Findings
MetaIE improves few-shot IE performance across 13 datasets.
MetaIE outperforms models from pre-training, multi-task, and symbolic distillation.
Extensive analysis on dataset size and model architecture impacts.
Abstract
Information extraction (IE) is a fundamental area in natural language processing where prompting large language models (LLMs), even with in-context examples, cannot defeat small LMs tuned on very small IE datasets. We observe that IE tasks, such as named entity recognition and relation extraction, all focus on extracting important information, which can be formalized as a label-to-span matching. In this paper, we propose a novel framework MetaIE to build a small LM as meta-model by learning to extract "important information", i.e., the meta-understanding of IE, so that this meta-model can be adapted to all kind of IE tasks effectively and efficiently. Specifically, MetaIE obtains the small LM via a symbolic distillation from an LLM following the label-to-span scheme. We construct the distillation dataset via sampling sentences from language model pre-training datasets (e.g., OpenWebText…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Quality and Management · Neural Networks and Applications · Natural Language Processing Techniques
MethodsFocus
