Transductive Learning for Abstractive News Summarization
Arthur Bra\v{z}inskas, Mengwen Liu, Ramesh Nallapati, Sujith Ravi,, Markus Dreyer

TL;DR
This paper introduces a transductive learning approach for abstractive news summarization that fine-tunes models on test inputs, leading to improved ROUGE scores and more coherent, abstract summaries without altering the original architecture.
Contribution
It is the first to apply transductive learning to summarization, enhancing performance by fine-tuning on test set inputs and pseudo summaries, with state-of-the-art results.
Findings
Improved ROUGE-L scores on CNN/DM and NYT datasets.
Enhanced summary coherence and abstractiveness.
Effective transductive fine-tuning without architectural changes.
Abstract
Pre-trained and fine-tuned news summarizers are expected to generalize to news articles unseen in the fine-tuning (training) phase. However, these articles often contain specifics, such as new events and people, a summarizer could not learn about in training. This applies to scenarios such as a news publisher training a summarizer on dated news and summarizing incoming recent news. In this work, we explore the first application of transductive learning to summarization where we further fine-tune models on test set inputs. Specifically, we construct pseudo summaries from salient article sentences and input randomly masked articles. Moreover, this approach is also beneficial in the fine-tuning phase, where we jointly predict extractive pseudo references and abstractive gold summaries in the training set. We show that our approach yields state-of-the-art results on CNN/DM and NYT datasets,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques
