Simultaneously Self-Attending to All Mentions for Full-Abstract   Biological Relation Extraction

Patrick Verga; Emma Strubell; Andrew McCallum

arXiv:1802.10569·cs.CL·March 1, 2018

Simultaneously Self-Attending to All Mentions for Full-Abstract Biological Relation Extraction

Patrick Verga, Emma Strubell, Andrew McCallum

PDF

1 Repo

TL;DR

This paper introduces a novel model for biological relation extraction that predicts relationships across all mention pairs in a document simultaneously, leveraging self-attention to improve accuracy and efficiency, especially in weakly labeled settings.

Contribution

The authors propose a self-attention based model that predicts all mention pair relationships at once and effectively handles weakly labeled data, advancing biological relation extraction.

Findings

01

Achieved state-of-the-art results on Biocreative V dataset.

02

Introduced a new large-scale biological relation dataset.

03

Demonstrated effectiveness without external knowledge bases.

Abstract

Most work in relation extraction forms a prediction by looking at a short span of text within a single sentence containing a single entity pair mention. This approach often does not consider interactions across mentions, requires redundant computation for each mention pair, and ignores relationships expressed across sentence boundaries. These problems are exacerbated by the document- (rather than sentence-) level annotation common in biological text. In response, we propose a model which simultaneously predicts relationships between all mention pairs in a document. We form pairwise predictions over entire paper abstracts using an efficient self-attention encoder. All-pairs mention scores allow us to perform multi-instance learning by aggregating over mentions to form entity pair representations. We further adapt to settings without mention-level annotation by jointly training to predict…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

patverga/bran
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.