StructVAE: Tree-structured Latent Variable Models for Semi-supervised   Semantic Parsing

Pengcheng Yin; Chunting Zhou; Junxian He; Graham Neubig

arXiv:1806.07832·cs.CL·June 21, 2018

StructVAE: Tree-structured Latent Variable Models for Semi-supervised Semantic Parsing

Pengcheng Yin, Chunting Zhou, Junxian He, Graham Neubig

PDF

Open Access 5 Repos

TL;DR

StructVAE is a semi-supervised model that leverages unlabeled data to improve semantic parsing by modeling tree-structured latent variables, outperforming supervised models on ATIS and Python code tasks.

Contribution

Introduces StructVAE, a novel tree-structured variational auto-encoder for semi-supervised semantic parsing utilizing unlabeled data.

Findings

01

Outperforms supervised models with additional unlabeled data

02

Effective on ATIS domain and Python code generation

03

Models tree-structured latent variables for meaning representations

Abstract

Semantic parsing is the task of transducing natural language (NL) utterances into formal meaning representations (MRs), commonly represented as tree structures. Annotating NL utterances with their corresponding MRs is expensive and time-consuming, and thus the limited availability of labeled data often becomes the bottleneck of data-driven, supervised models. We introduce StructVAE, a variational auto-encoding model for semisupervised semantic parsing, which learns both from limited amounts of parallel data, and readily-available unlabeled NL utterances. StructVAE models latent MRs not observed in the unlabeled data as tree-structured latent variables. Experiments on semantic parsing on the ATIS domain and Python code generation show that with extra unlabeled data, StructVAE outperforms strong supervised models.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification