Modern Methods for Text Generation
Dimas Munoz Montesinos

TL;DR
This paper reviews modern Transformer-based models like BERT and GPT-2, highlighting their architecture and comparing their effectiveness in text generation tasks such as translation and summarization.
Contribution
It provides an analysis and comparison of BERT and GPT-2, emphasizing their performance in text generation and understanding of sequential data.
Findings
Transformers improve understanding of sequential data
BERT and GPT-2 excel in text classification and translation
Comparison shows differences in output quality for text generation
Abstract
Synthetic text generation is challenging and has limited success. Recently, a new architecture, called Transformers, allow machine learning models to understand better sequential data, such as translation or summarization. BERT and GPT-2, using Transformers in their cores, have shown a great performance in tasks such as text classification, translation and NLI tasks. In this article, we analyse both algorithms and compare their output quality in text generation tasks.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques
MethodsLinear Layer · Cosine Annealing · Linear Warmup With Cosine Annealing · Byte Pair Encoding · Discriminative Fine-Tuning · GPT-2 · Layer Normalization · Weight Decay · Dropout · Linear Warmup With Linear Decay
