A Simple but Effective Approach to Improve Structured Language Model   Output for Information Extraction

Yinghao Li; Rampi Ramprasad; Chao Zhang

arXiv:2402.13364·cs.CL·February 22, 2024·3 cites

A Simple but Effective Approach to Improve Structured Language Model Output for Information Extraction

Yinghao Li, Rampi Ramprasad, Chao Zhang

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper presents G&O, a simple two-step prompting method that significantly improves large language models' ability to generate structured text for information extraction tasks like NER and RE, with minimal extra effort.

Contribution

The paper introduces G&O, a novel two-step pipeline that separates content generation from structuring, enhancing LLM performance in structured text tasks.

Findings

01

Significant performance improvements in zero-shot NER and RE tasks.

02

G&O can be combined with other strategies for further gains.

03

Minimal additional effort required for implementation.

Abstract

Large language models (LLMs) have demonstrated impressive abilities in generating unstructured natural language according to instructions. However, their performance can be inconsistent when tasked with producing text that adheres to specific structured formats, which is crucial in applications like named entity recognition (NER) or relation extraction (RE). To address this issue, this paper introduces an efficient method, G&O, to enhance their structured text generation capabilities. It breaks the generation into a two-step pipeline: initially, LLMs generate answers in natural language as intermediate responses. Subsequently, LLMs are asked to organize the output into the desired structure, using the intermediate responses as context. G&O effectively separates the generation of content from the structuring process, reducing the pressure of completing two orthogonal tasks…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yinghao-li/gno-ie
noneOfficial

Videos

A Simple but Effective Approach to Improve Structured Language Model Output for Information Extraction· underline

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling