BERT has a Mouth, and It Must Speak: BERT as a Markov Random Field   Language Model

Alex Wang; Kyunghyun Cho

arXiv:1902.04094·cs.CL·April 11, 2019·141 cites

BERT has a Mouth, and It Must Speak: BERT as a Markov Random Field Language Model

Alex Wang, Kyunghyun Cho

PDF

Open Access 5 Repos

TL;DR

This paper demonstrates that BERT can be modeled as a Markov random field language model, enabling sentence sampling that produces diverse yet slightly less fluent sentences compared to traditional models.

Contribution

It introduces a novel formulation of BERT as a Markov random field, allowing natural sentence sampling and analysis of its generative capabilities.

Findings

01

BERT can be effectively sampled as a Markov random field.

02

Generated sentences are more diverse than traditional models.

03

Sentence quality is slightly lower but still high.

Abstract

We show that BERT (Devlin et al., 2018) is a Markov random field language model. This formulation gives way to a natural procedure to sample sentences from BERT. We generate from BERT and find that it can produce high-quality, fluent generations. Compared to the generations of a traditional left-to-right language model, BERT generates sentences that are more diverse but of slightly worse quality.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech Recognition and Synthesis

MethodsLinear Layer · Residual Connection · Attention Dropout · Linear Warmup With Linear Decay · Weight Decay · Refunds@Expedia|||How do I get a full refund from Expedia? · Dense Connections · Adam · WordPiece · Softmax