Learning to Generate Reviews and Discovering Sentiment

Alec Radford; Rafal Jozefowicz; Ilya Sutskever

arXiv:1704.01444·cs.LG·April 7, 2017·351 cites

Learning to Generate Reviews and Discovering Sentiment

Alec Radford, Rafal Jozefowicz, Ilya Sutskever

PDF

Open Access 2 Repos 1 Video

TL;DR

This paper demonstrates that byte-level recurrent language models can learn disentangled high-level features like sentiment in an unsupervised manner, achieving state-of-the-art results with minimal labeled data and enabling sentiment-controlled text generation.

Contribution

It reveals that a single unit in unsupervised byte-level models can perform sentiment analysis and influence generation, advancing understanding of learned representations.

Findings

01

A single unit encodes sentiment in the model.

02

State-of-the-art sentiment classification with minimal labeled data.

03

Sentiment control in generated samples by fixing the sentiment unit.

Abstract

We explore the properties of byte-level recurrent language models. When given sufficient amounts of capacity, training data, and compute time, the representations learned by these models include disentangled features corresponding to high-level concepts. Specifically, we find a single unit which performs sentiment analysis. These representations, learned in an unsupervised manner, achieve state of the art on the binary subset of the Stanford Sentiment Treebank. They are also very data efficient. When using only a handful of labeled examples, our approach matches the performance of strong baselines trained on full datasets. We also demonstrate the sentiment unit has a direct influence on the generative process of the model. Simply fixing its value to be positive or negative generates samples with the corresponding positive or negative sentiment.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

AI Discovers Sentiment By Writing Amazon Reviews· youtube

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques