Reinforcing Semantic-Symmetry for Document Summarization

Mingyang Song; Liping Jing

arXiv:2112.07583·cs.CL·April 1, 2022

Reinforcing Semantic-Symmetry for Document Summarization

Mingyang Song, Liping Jing

PDF

Open Access

TL;DR

This paper introduces ReSyM, a reinforcement learning-based model that explicitly addresses semantic gaps in document summarization, leading to more semantically consistent summaries and outperforming state-of-the-art methods on benchmark datasets.

Contribution

ReSyM is the first to explicitly incorporate semantic consistency rewards for both extractor and abstractor modules in document summarization.

Findings

01

ReSyM outperforms baselines on CNN/Daily Mail and BigPatent datasets.

02

Semantic rewards improve summary semantic fidelity.

03

Enhanced sentence representation learning boosts performance.

Abstract

Document summarization condenses a long document into a short version with salient information and accurate semantic descriptions. The main issue is how to make the output summary semantically consistent with the input document. To reach this goal, recently, researchers have focused on supervised end-to-end hybrid approaches, which contain an extractor module and abstractor module. Among them, the extractor identifies the salient sentences from the input document, and the abstractor generates a summary from the salient sentences. This model successfully keeps the consistency between the generated summary and the reference summary via various strategies (e.g., reinforcement learning). There are two semantic gaps when training the hybrid model (one is between document and extracted sentences, and the other is between extracted sentences and summary). However, they are not explicitly…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques