Wrap-Up: a Trainable Discourse Module for Information Extraction

S. Soderland; Lehnert. W

arXiv:cs/9412101·cs.AI·November 17, 2014

Wrap-Up: a Trainable Discourse Module for Information Extraction

S. Soderland, Lehnert. W

PDF

Open Access

TL;DR

This paper introduces Wrap-Up, a trainable discourse module for information extraction that automatically learns classifiers and features, enabling higher-level inference in unrestricted text with performance comparable to manually customized systems.

Contribution

The paper presents a fully trainable IE discourse component that automatically determines classifiers and features, advancing beyond previous limited, lower-level processing approaches.

Findings

01

Performance matches manually customized modules

02

Automatically derives classifiers and features

03

Enables higher-level inferences in IE systems

Abstract

The vast amounts of on-line text now available have led to renewed interest in information extraction (IE) systems that analyze unrestricted text, producing a structured representation of selected information from the text. This paper presents a novel approach that uses machine learning to acquire knowledge for some of the higher level IE processing. Wrap-Up is a trainable IE discourse component that makes intersentential inferences and identifies logical relations among information extracted from the text. Previous corpus-based approaches were limited to lower level processing such as part-of-speech tagging, lexical disambiguation, and dictionary construction. Wrap-Up is fully trainable, and not only automatically decides what classifiers are needed, but even derives the feature set for each classifier automatically. Performance equals that of a partially trainable discourse module…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Advanced Text Analysis Techniques