End-to-End Relation Extraction using LSTMs on Sequences and Tree Structures
Makoto Miwa, Mohit Bansal

TL;DR
This paper introduces a novel neural network model combining sequence and tree-structured LSTMs for end-to-end relation extraction, achieving significant improvements over previous methods on benchmark datasets.
Contribution
The authors propose a unified LSTM-based model that jointly captures entity and relation information using sequence and dependency tree structures, with training strategies to enhance entity detection.
Findings
Achieved 12.1% and 5.7% relative error reductions in F1-score on ACE datasets.
Outperformed state-of-the-art feature-based models in end-to-end relation extraction.
Compared favorably to CNN-based models on nominal relation classification.
Abstract
We present a novel end-to-end neural model to extract entities and relations between them. Our recurrent neural network based model captures both word sequence and dependency tree substructure information by stacking bidirectional tree-structured LSTM-RNNs on bidirectional sequential LSTM-RNNs. This allows our model to jointly represent both entities and relations with shared parameters in a single model. We further encourage detection of entities during training and use of entity information in relation extraction via entity pretraining and scheduled sampling. Our model improves over the state-of-the-art feature-based model on end-to-end relation extraction, achieving 12.1% and 5.7% relative error reductions in F1-score on ACE2005 and ACE2004, respectively. We also show that our LSTM-RNN based model compares favorably to the state-of-the-art CNN based model (in F1-score) on nominal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques
