Differentiable Perturb-and-Parse: Semi-Supervised Parsing with a Structured Variational Autoencoder
Caio Corro, Ivan Titov

TL;DR
This paper introduces a semi-supervised dependency parsing method that leverages unlabeled text using a differentiable relaxation of structured variational autoencoders, improving parsing accuracy across multiple languages.
Contribution
It presents a novel differentiable perturb-and-parse model for semi-supervised parsing, enabling gradient-based learning with intractable structured latent variables.
Findings
Effective on English, French, and Swedish datasets.
Outperforms existing semi-supervised parsing methods.
Demonstrates the utility of differentiable dynamic programming in structured models.
Abstract
Human annotation for syntactic parsing is expensive, and large resources are available only for a fraction of languages. A question we ask is whether one can leverage abundant unlabeled texts to improve syntactic parsers, beyond just using the texts to obtain more generalisable lexical features (i.e. beyond word embeddings). To this end, we propose a novel latent-variable generative model for semi-supervised syntactic dependency parsing. As exact inference is intractable, we introduce a differentiable relaxation to obtain approximate samples and compute gradients with respect to the parser parameters. Our method (Differentiable Perturb-and-Parse) relies on differentiable dynamic programming over stochastically perturbed edge scores. We demonstrate effectiveness of our approach with experiments on English, French and Swedish.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Speech Recognition and Synthesis
