Differentiable Perturb-and-Parse: Semi-Supervised Parsing with a   Structured Variational Autoencoder

Caio Corro; Ivan Titov

arXiv:1807.09875·cs.CL·February 22, 2019·44 cites

Differentiable Perturb-and-Parse: Semi-Supervised Parsing with a Structured Variational Autoencoder

Caio Corro, Ivan Titov

PDF

Open Access

TL;DR

This paper introduces a semi-supervised dependency parsing method that leverages unlabeled text using a differentiable relaxation of structured variational autoencoders, improving parsing accuracy across multiple languages.

Contribution

It presents a novel differentiable perturb-and-parse model for semi-supervised parsing, enabling gradient-based learning with intractable structured latent variables.

Findings

01

Effective on English, French, and Swedish datasets.

02

Outperforms existing semi-supervised parsing methods.

03

Demonstrates the utility of differentiable dynamic programming in structured models.

Abstract

Human annotation for syntactic parsing is expensive, and large resources are available only for a fraction of languages. A question we ask is whether one can leverage abundant unlabeled texts to improve syntactic parsers, beyond just using the texts to obtain more generalisable lexical features (i.e. beyond word embeddings). To this end, we propose a novel latent-variable generative model for semi-supervised syntactic dependency parsing. As exact inference is intractable, we introduce a differentiable relaxation to obtain approximate samples and compute gradients with respect to the parser parameters. Our method (Differentiable Perturb-and-Parse) relies on differentiable dynamic programming over stochastically perturbed edge scores. We demonstrate effectiveness of our approach with experiments on English, French and Swedish.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Speech Recognition and Synthesis