Unsupervised Learning of Sentence Representations Using Sequence   Consistency

Siddhartha Brahma

arXiv:1808.04217·cs.CL·January 25, 2019·6 cites

Unsupervised Learning of Sentence Representations Using Sequence Consistency

Siddhartha Brahma

PDF

Open Access

TL;DR

This paper introduces ConsSent, an unsupervised method for learning sentence representations by enforcing sequence consistency constraints, leading to improved performance on various NLP tasks.

Contribution

It proposes a novel unsupervised approach that uses sequence consistency constraints and perturbation-based training to learn effective sentence encoders.

Findings

01

ConsSent outperforms strong unsupervised and supervised baselines.

02

Multitask training and ensemble methods further improve results.

03

The approach is effective across multiple transfer and linguistic tasks.

Abstract

Computing universal distributed representations of sentences is a fundamental task in natural language processing. We propose ConsSent, a simple yet surprisingly powerful unsupervised method to learn such representations by enforcing consistency constraints on sequences of tokens. We consider two classes of such constraints -- sequences that form a sentence and between two sequences that form a sentence when merged. We learn sentence encoders by training them to distinguish between consistent and inconsistent examples, the latter being generated by randomly perturbing consistent examples in six different ways. Extensive evaluation on several transfer learning and linguistic probing tasks shows improved performance over strong unsupervised and supervised baselines, substantially surpassing them in several cases. Our best results are achieved by training sentence encoders in a multitask…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Sentiment Analysis and Opinion Mining