Low-Resource Neural Headline Generation

Ottokar Tilk; Tanel Alum\"ae

arXiv:1707.09769·cs.CL·August 1, 2017

Low-Resource Neural Headline Generation

Ottokar Tilk, Tanel Alum\"ae

PDF

TL;DR

This paper introduces pretraining methods for neural headline generation models that significantly improve performance on small datasets by leveraging all available text and training all model parameters.

Contribution

It presents novel pretraining techniques that enable training all parameters and utilize all text, leading to substantial performance gains on low-resource datasets.

Findings

01

Up to 32.4% reduction in perplexity

02

ROUGE score improvement of 2.84 points

03

Effective pretraining on small datasets

Abstract

Recent neural headline generation models have shown great results, but are generally trained on very large datasets. We focus our efforts on improving headline quality on smaller datasets by the means of pretraining. We propose new methods that enable pre-training all the parameters of the model and utilize all available text, resulting in improvements by up to 32.4% relative in perplexity and 2.84 points in ROUGE.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.