Reinforcing Semantic-Symmetry for Document Summarization
Mingyang Song, Liping Jing

TL;DR
This paper introduces ReSyM, a reinforcement learning-based model that explicitly addresses semantic gaps in document summarization, leading to more semantically consistent summaries and outperforming state-of-the-art methods on benchmark datasets.
Contribution
ReSyM is the first to explicitly incorporate semantic consistency rewards for both extractor and abstractor modules in document summarization.
Findings
ReSyM outperforms baselines on CNN/Daily Mail and BigPatent datasets.
Semantic rewards improve summary semantic fidelity.
Enhanced sentence representation learning boosts performance.
Abstract
Document summarization condenses a long document into a short version with salient information and accurate semantic descriptions. The main issue is how to make the output summary semantically consistent with the input document. To reach this goal, recently, researchers have focused on supervised end-to-end hybrid approaches, which contain an extractor module and abstractor module. Among them, the extractor identifies the salient sentences from the input document, and the abstractor generates a summary from the salient sentences. This model successfully keeps the consistency between the generated summary and the reference summary via various strategies (e.g., reinforcement learning). There are two semantic gaps when training the hybrid model (one is between document and extracted sentences, and the other is between extracted sentences and summary). However, they are not explicitly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques
