Modern Methods for Text Generation

Dimas Munoz Montesinos

arXiv:2009.04968·cs.CL·September 11, 2020

Modern Methods for Text Generation

Dimas Munoz Montesinos

PDF

Open Access 2 Repos

TL;DR

This paper reviews modern Transformer-based models like BERT and GPT-2, highlighting their architecture and comparing their effectiveness in text generation tasks such as translation and summarization.

Contribution

It provides an analysis and comparison of BERT and GPT-2, emphasizing their performance in text generation and understanding of sequential data.

Findings

01

Transformers improve understanding of sequential data

02

BERT and GPT-2 excel in text classification and translation

03

Comparison shows differences in output quality for text generation

Abstract

Synthetic text generation is challenging and has limited success. Recently, a new architecture, called Transformers, allow machine learning models to understand better sequential data, such as translation or summarization. BERT and GPT-2, using Transformers in their cores, have shown a great performance in tasks such as text classification, translation and NLI tasks. In this article, we analyse both algorithms and compare their output quality in text generation tasks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques

MethodsLinear Layer · Cosine Annealing · Linear Warmup With Cosine Annealing · Byte Pair Encoding · Discriminative Fine-Tuning · GPT-2 · Layer Normalization · Weight Decay · Dropout · Linear Warmup With Linear Decay