Variational Sequential Labelers for Semi-Supervised Learning

Mingda Chen; Qingming Tang; Karen Livescu; Kevin Gimpel

arXiv:1906.09535·cs.CL·June 25, 2019·1 cites

Variational Sequential Labelers for Semi-Supervised Learning

Mingda Chen, Qingming Tang, Karen Livescu, Kevin Gimpel

PDF

Open Access 1 Repo

TL;DR

This paper presents a family of variational models for semi-supervised sequence labeling that combine generative and discriminative approaches, leveraging latent variables to improve performance on multiple datasets.

Contribution

It introduces a novel multitask variational framework with hierarchical latent variables for semi-supervised sequence labeling, outperforming standard baselines.

Findings

01

Models outperform standard baselines on 8 datasets.

02

Unlabeled data further improves model performance.

03

Hierarchical latent structures enhance label and word information integration.

Abstract

We introduce a family of multitask variational methods for semi-supervised sequence labeling. Our model family consists of a latent-variable generative model and a discriminative labeler. The generative models use latent variables to define the conditional probability of a word given its context, drawing inspiration from word prediction objectives commonly used in learning word embeddings. The labeler helps inject discriminative information into the latent space. We explore several latent variable configurations, including ones with hierarchical structure, which enables the model to account for both label-specific and word-specific information. Our models consistently outperform standard sequential baselines on 8 sequence labeling datasets, and improve further with unlabeled data.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mingdachen/vsl
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech Recognition and Synthesis