TL;DR
This paper introduces a novel model for biological relation extraction that predicts relationships across all mention pairs in a document simultaneously, leveraging self-attention to improve accuracy and efficiency, especially in weakly labeled settings.
Contribution
The authors propose a self-attention based model that predicts all mention pair relationships at once and effectively handles weakly labeled data, advancing biological relation extraction.
Findings
Achieved state-of-the-art results on Biocreative V dataset.
Introduced a new large-scale biological relation dataset.
Demonstrated effectiveness without external knowledge bases.
Abstract
Most work in relation extraction forms a prediction by looking at a short span of text within a single sentence containing a single entity pair mention. This approach often does not consider interactions across mentions, requires redundant computation for each mention pair, and ignores relationships expressed across sentence boundaries. These problems are exacerbated by the document- (rather than sentence-) level annotation common in biological text. In response, we propose a model which simultaneously predicts relationships between all mention pairs in a document. We form pairwise predictions over entire paper abstracts using an efficient self-attention encoder. All-pairs mention scores allow us to perform multi-instance learning by aggregating over mentions to form entity pair representations. We further adapt to settings without mention-level annotation by jointly training to predict…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
