End-to-End Information Extraction without Token-Level Supervision

Rasmus Berg Palm; Dirk Hovy; Florian Laws; Ole Winther

arXiv:1707.04913·cs.CL·July 18, 2017

End-to-End Information Extraction without Token-Level Supervision

Rasmus Berg Palm, Dirk Hovy, Florian Laws, Ole Winther

PDF

1 Repo

TL;DR

This paper introduces an end-to-end information extraction model that learns directly from raw text and output pairs, eliminating the need for costly token-level labels, and demonstrates competitive results on multiple datasets.

Contribution

It presents a novel pointer network-based E2E IE model trained without token-level supervision, expanding applicability to tasks lacking detailed annotations.

Findings

01

Achieves results within a few percentage points of token-supervised baselines.

02

Demonstrates feasibility of E2E IE without token-level labels.

03

Opens new possibilities for tasks with only raw input-output data.

Abstract

Most state-of-the-art information extraction approaches rely on token-level labels to find the areas of interest in text. Unfortunately, these labels are time-consuming and costly to create, and consequently, not available for many real-life IE tasks. To make matters worse, token-level labels are usually not the desired output, but just an intermediary step. End-to-end (E2E) models, which take raw text as input and produce the desired output directly, need not depend on token-level labels. We propose an E2E model based on pointer networks, which can be trained directly on pairs of raw input and output text. We evaluate our model on the ATIS data set, MIT restaurant corpus and the MIT movie corpus and compare to neural baselines that do use token-level labels. We achieve competitive results, within a few percentage points of the baselines, showing the feasibility of E2E information…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

rasmusbergpalm/e2e-ie-release
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.