A General Framework for Information Extraction using Dynamic Span Graphs

Yi Luan; Dave Wadden; Luheng He; Amy Shah; Mari Ostendorf; Hannaneh; Hajishirzi

arXiv:1904.03296·cs.CL·April 9, 2019·37 cites

A General Framework for Information Extraction using Dynamic Span Graphs

Yi Luan, Dave Wadden, Luheng He, Amy Shah, Mari Ostendorf, Hannaneh, Hajishirzi

PDF

Open Access 3 Repos

TL;DR

This paper presents a versatile framework for information extraction that uses dynamically built span graphs to improve entity, relation, and coreference detection, outperforming previous methods across various datasets.

Contribution

The authors propose a novel dynamic span graph approach that propagates confidence scores to refine span representations, enhancing multi-task information extraction performance.

Findings

01

Significant outperformance of state-of-the-art methods on multiple datasets.

02

Effective detection of nested span entities with improved F1 scores.

03

Dynamic propagation of confidence scores improves span representation quality.

Abstract

We introduce a general framework for several information extraction tasks that share span representations using dynamically constructed span graphs. The graphs are constructed by selecting the most confident entity spans and linking these nodes with confidence-weighted relation types and coreferences. The dynamic span graph allows coreference and relation type confidences to propagate through the graph to iteratively refine the span representations. This is unlike previous multi-task frameworks for information extraction in which the only interaction between tasks is in the shared first-layer LSTM. Our framework significantly outperforms the state-of-the-art on multiple information extraction tasks across multiple datasets reflecting different domains. We further observe that the span enumeration approach is good at detecting nested span entities, with significant F1 score improvement…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Data Quality and Management

MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory