Importance of Copying Mechanism for News Headline Generation

Ilya Gusev

arXiv:1904.11475·cs.CL·May 6, 2020·5 cites

Importance of Copying Mechanism for News Headline Generation

Ilya Gusev

PDF

Open Access 1 Repo

TL;DR

This paper demonstrates that incorporating a copying mechanism in news headline generation models significantly improves performance, especially in morphologically rich languages like Russian, by effectively handling new named entities.

Contribution

It validates that models with copying mechanisms outperform those without, achieving higher ROUGE scores and better results on Russian news datasets.

Findings

01

Copying mechanism improves ROUGE scores by 8 points.

02

Model outperforms existing models on Russian news datasets.

03

Copying mechanism effectively handles new named entities.

Abstract

News headline generation is an essential problem of text summarization because it is constrained, well-defined, and is still hard to solve. Models with a limited vocabulary can not solve it well, as new named entities can appear regularly in the news and these entities often should be in the headline. News articles in morphologically rich languages such as Russian require model modifications due to a large number of possible word forms. This study aims to validate that models with a possibility of copying words from the original article performs better than models without such an option. The proposed model achieves a mean ROUGE score of 23 on the provided test dataset, which is 8 points greater than the result of a similar model without a copying mechanism. Moreover, the resulting model performs better than any known model on the new dataset of Russian news.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

IlyaGusev/summarus
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques