Fine-tune Bert for DocRED with Two-step Process

Hong Wang; Christfried Focke; Rob Sylvester; Nilesh Mishra; William; Wang

arXiv:1909.11898·cs.CL·September 27, 2019·116 cites

Fine-tune Bert for DocRED with Two-step Process

Hong Wang, Christfried Focke, Rob Sylvester, Nilesh Mishra, William, Wang

PDF

Open Access 1 Repo

TL;DR

This paper enhances document-level relation extraction by fine-tuning BERT in a two-step process, significantly improving over simple BiLSTM baselines on the DocRED dataset.

Contribution

It introduces a two-phase fine-tuning approach for BERT, improving relation prediction accuracy in document-level relation extraction tasks.

Findings

01

BERT-based models outperform BiLSTM baselines.

02

Two-step process improves relation classification accuracy.

03

Method achieves state-of-the-art results on DocRED.

Abstract

Modelling relations between multiple entities has attracted increasing attention recently, and a new dataset called DocRED has been collected in order to accelerate the research on the document-level relation extraction. Current baselines for this task uses BiLSTM to encode the whole document and are trained from scratch. We argue that such simple baselines are not strong enough to model to complex interaction between entities. In this paper, we further apply a pre-trained language model (BERT) to provide a stronger baseline for this task. We also find that solving this task in phases can further improve the performance. The first step is to predict whether or not two entities have a relation, the second step is to predict the specific relation.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hongwang600/DocRed
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques

MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory · Bidirectional LSTM