TL;DR
This paper introduces an end-to-end information extraction model that learns directly from raw text and output pairs, eliminating the need for costly token-level labels, and demonstrates competitive results on multiple datasets.
Contribution
It presents a novel pointer network-based E2E IE model trained without token-level supervision, expanding applicability to tasks lacking detailed annotations.
Findings
Achieves results within a few percentage points of token-supervised baselines.
Demonstrates feasibility of E2E IE without token-level labels.
Opens new possibilities for tasks with only raw input-output data.
Abstract
Most state-of-the-art information extraction approaches rely on token-level labels to find the areas of interest in text. Unfortunately, these labels are time-consuming and costly to create, and consequently, not available for many real-life IE tasks. To make matters worse, token-level labels are usually not the desired output, but just an intermediary step. End-to-end (E2E) models, which take raw text as input and produce the desired output directly, need not depend on token-level labels. We propose an E2E model based on pointer networks, which can be trained directly on pairs of raw input and output text. We evaluate our model on the ATIS data set, MIT restaurant corpus and the MIT movie corpus and compare to neural baselines that do use token-level labels. We achieve competitive results, within a few percentage points of the baselines, showing the feasibility of E2E information…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
