Neural Headline Generation with Sentence-wise Optimization

Ayana; Shiqi Shen; Yu Zhao; Zhiyuan Liu; Maosong Sun

arXiv:1604.01904·cs.CL·October 11, 2016·39 cites

Neural Headline Generation with Sentence-wise Optimization

Ayana, Shiqi Shen, Yu Zhao, Zhiyuan Liu, Maosong Sun

PDF

Open Access

TL;DR

This paper introduces a sentence-level optimization approach for neural headline generation using minimum risk training, leading to significant performance improvements over existing models on English and Chinese datasets.

Contribution

It proposes a novel sentence-level training method with minimum risk training for neural headline generation, surpassing traditional word-level maximum likelihood approaches.

Findings

01

Outperforms state-of-the-art systems on English headline generation

02

Achieves significant improvements on Chinese headline tasks

03

Demonstrates effectiveness of sentence-level optimization

Abstract

Recently, neural models have been proposed for headline generation by learning to map documents to headlines with recurrent neural networks. Nevertheless, as traditional neural network utilizes maximum likelihood estimation for parameter optimization, it essentially constrains the expected training objective within word level rather than sentence level. Moreover, the performance of model prediction significantly relies on training data distribution. To overcome these drawbacks, we employ minimum risk training strategy in this paper, which directly optimizes model parameters in sentence level with respect to evaluation metrics and leads to significant improvements for headline generation. Experiment results show that our models outperforms state-of-the-art systems on both English and Chinese headline generation tasks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications