Soft Layer-Specific Multi-Task Summarization with Entailment and Question Generation
Han Guo, Ramakanth Pasunuru, Mohit Bansal

TL;DR
This paper introduces a multi-task learning approach for abstractive summarization that incorporates question generation and entailment tasks, using novel layer-specific sharing mechanisms to improve summary quality and logical consistency.
Contribution
It proposes a new multi-task architecture with high-level layer-specific sharing and soft-sharing, enhancing summarization by integrating entailment and question generation tasks.
Findings
Significant improvements over state-of-the-art on CNN/DailyMail and Gigaword datasets.
Demonstrated better saliency detection and entailment skills in summaries.
Achieved strong transfer performance on DUC-2002 dataset.
Abstract
An accurate abstractive summary of a document should contain all its salient information and should be logically entailed by the input document. We improve these important aspects of abstractive summarization via multi-task learning with the auxiliary tasks of question generation and entailment generation, where the former teaches the summarization model how to look for salient questioning-worthy details, and the latter teaches the model how to rewrite a summary which is a directed-logical subset of the input document. We also propose novel multi-task architectures with high-level (semantic) layer-specific sharing across multiple encoder and decoder layers of the three tasks, as well as soft-sharing mechanisms (and show performance ablations and analysis examples of each contribution). Overall, we achieve statistically significant improvements over the state-of-the-art on both the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
