Towards Semi-Supervised Learning for Deep Semantic Role Labeling

Sanket Vaibhav Mehta; Jay Yoon Lee; Jaime Carbonell

arXiv:1808.09543·cs.CL·August 30, 2018

Towards Semi-Supervised Learning for Deep Semantic Role Labeling

Sanket Vaibhav Mehta, Jay Yoon Lee, Jaime Carbonell

PDF

TL;DR

This paper introduces a semi-supervised approach for deep semantic role labeling that leverages syntactic constraints and unlabeled data, significantly improving performance in low-resource settings.

Contribution

It proposes a novel semi-supervised training method with syntactic-inconsistency loss, enhancing SRL accuracy with limited labeled data.

Findings

01

Achieves +1.58 F1 with 1% labeled data

02

Improves +0.78 F1 with 10% labeled data

03

Enhances inference F1 by +3.67 and +2.1 with 1% and 10% data

Abstract

Neural models have shown several state-of-the-art performances on Semantic Role Labeling (SRL). However, the neural models require an immense amount of semantic-role corpora and are thus not well suited for low-resource languages or domains. The paper proposes a semi-supervised semantic role labeling method that outperforms the state-of-the-art in limited SRL training corpora. The method is based on explicitly enforcing syntactic constraints by augmenting the training objective with a syntactic-inconsistency loss component and uses SRL-unlabeled instances to train a joint-objective LSTM. On CoNLL-2012 English section, the proposed semi-supervised training with 1%, 10% SRL-labeled data and varying amounts of SRL-unlabeled data achieves +1.58, +0.78 F1, respectively, over the pre-trained models that were trained on SOTA architecture with ELMo on the same SRL-labeled data. Additionally, by…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory · Bidirectional LSTM · Softmax · ELMo