ROUGE 2.0: Updated and Improved Measures for Evaluation of Summarization   Tasks

Kavita Ganesan

arXiv:1803.01937·cs.IR·March 7, 2018·106 cites

ROUGE 2.0: Updated and Improved Measures for Evaluation of Summarization Tasks

Kavita Ganesan

PDF

Open Access 3 Repos

TL;DR

ROUGE 2.0 introduces enhanced evaluation metrics for summarization that better capture semantic similarity and topic coverage, addressing limitations of the original ROUGE measures.

Contribution

The paper presents ROUGE 2.0, a set of improved evaluation measures that incorporate synonyms and topic coverage for more accurate summary assessment.

Findings

01

ROUGE 2.0 measures better reflect summary quality.

02

Enhanced metrics capture semantic and topical coverage.

03

Improved correlation with human judgment.

Abstract

Evaluation of summarization tasks is extremely crucial to determining the quality of machine generated summaries. Over the last decade, ROUGE has become the standard automatic evaluation measure for evaluating summarization tasks. While ROUGE has been shown to be effective in capturing n-gram overlap between system and human composed summaries, there are several limitations with the existing ROUGE measures in terms of capturing synonymous concepts and coverage of topics. Thus, often times ROUGE scores do not reflect the true quality of summaries and prevents multi-faceted evaluation of summaries (i.e. by topics, by overall content coverage and etc). In this paper, we introduce ROUGE 2.0, which has several updated measures of ROUGE: ROUGE-N+Synonyms, ROUGE-Topic, ROUGE-Topic+Synonyms, ROUGE-TopicUniq and ROUGE-TopicUniq+Synonyms; all of which are improvements over the core ROUGE measures.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Semantic Web and Ontologies