Semi-supervised Autoencoding Projective Dependency Parsing

Xiao Zhang; Dan Goldwasser

arXiv:2011.00704·cs.CL·November 3, 2020

Semi-supervised Autoencoding Projective Dependency Parsing

Xiao Zhang, Dan Goldwasser

PDF

Open Access

TL;DR

This paper introduces two semi-supervised autoencoding models for projective dependency parsing that leverage unlabeled data to improve parsing accuracy, outperforming previous semi-supervised approaches.

Contribution

It proposes two novel autoencoding models, LAP and GAP, for semi-supervised dependency parsing, with exact inference and shared parameters for labeled and unlabeled data.

Findings

01

Models improve performance with limited labeled data.

02

Outperform previous semi-supervised models.

03

Effective use of unlabeled data in dependency parsing.

Abstract

We describe two end-to-end autoencoding models for semi-supervised graph-based projective dependency parsing. The first model is a Locally Autoencoding Parser (LAP) encoding the input using continuous latent variables in a sequential manner; The second model is a Globally Autoencoding Parser (GAP) encoding the input into dependency trees as latent variables, with exact inference. Both models consist of two parts: an encoder enhanced by deep neural networks (DNN) that can utilize the contextual information to encode the input into latent variables, and a decoder which is a generative model able to reconstruct the input. Both LAP and GAP admit a unified structure with different loss functions for labeled and unlabeled data with shared parameters. We conducted experiments on WSJ and UD dependency parsing data sets, showing that our models can exploit the unlabeled data to improve the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Genomics and Phylogenetic Studies