Self-Training for Unsupervised Parsing with PRPN

Anhad Mohananey; Katharina Kann; Samuel R. Bowman

arXiv:2005.13455·cs.CL·May 28, 2020

Self-Training for Unsupervised Parsing with PRPN

Anhad Mohananey, Katharina Kann, Samuel R. Bowman

PDF

TL;DR

This paper introduces a self-training approach for neural unsupervised parsing using the PRPN model, significantly improving parsing accuracy and enabling semi-supervised learning in low-resource scenarios.

Contribution

It extends the PRPN architecture for semi-supervised training and demonstrates substantial performance gains over previous methods.

Findings

01

Self-training improves F1 score by 8.1%.

02

Model outperforms previous state-of-the-art by 1.6% F1.

03

Effective in ultra-low-resource semi-supervised settings.

Abstract

Neural unsupervised parsing (UP) models learn to parse without access to syntactic annotations, while being optimized for another task like language modeling. In this work, we propose self-training for neural UP models: we leverage aggregated annotations predicted by copies of our model as supervision for future copies. To be able to use our model's predictions during training, we extend a recent neural UP architecture, the PRPN (Shen et al., 2018a) such that it can be trained in a semi-supervised fashion. We then add examples with parses predicted by our model to our unlabeled UP training data. Our self-trained model outperforms the PRPN by 8.1% F1 and the previous state of the art by 1.6% F1. In addition, we show that our architecture can also be helpful for semi-supervised parsing in ultra-low-resource settings.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.