Countering the Effects of Lead Bias in News Summarization via Multi-Stage Training and Auxiliary Losses
Matt Grenander, Yue Dong, Jackie Chi Kit Cheung, Annie Louis

TL;DR
This paper addresses lead bias in news summarization by introducing multi-stage training and auxiliary losses, improving model sensitivity to content importance throughout articles.
Contribution
It proposes two novel techniques—pretraining with unbiased shuffled data and an auxiliary ROUGE loss—to reduce lead bias in neural summarization systems.
Findings
Significant performance improvements over baseline models.
Auxiliary ROUGE loss outperforms pretraining alone.
Enhanced content importance distribution in summaries.
Abstract
Sentence position is a strong feature for news summarization, since the lead often (but not always) summarizes the key points of the article. In this paper, we show that recent neural systems excessively exploit this trend, which although powerful for many inputs, is also detrimental when summarizing documents where important content should be extracted from later parts of the article. We propose two techniques to make systems sensitive to the importance of content in different parts of the article. The first technique employs 'unbiased' data; i.e., randomly shuffled sentences of the source document, to pretrain the model. The second technique uses an auxiliary ROUGE-based loss that encourages the model to distribute importance scores throughout a document by mimicking sentence-level ROUGE scores on the training data. We show that these techniques significantly improve the performance…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques
