# DiCo-EXT: Diversity and Consistency-Guided Framework for Extractive Summarization

**Authors:** Yiming Wang, Jindong Zhang

PMC · DOI: 10.3390/e28010088 · Entropy · 2026-01-12

## TL;DR

This paper introduces DiCo-EXT, a new method for extractive summarization that reduces redundancy and improves diversity in summaries without sacrificing quality.

## Contribution

DiCo-EXT introduces a training framework with semantic consistency and diversity penalty loss terms for extractive summarization.

## Key findings

- DiCo-EXT reduces redundancy in summaries while maintaining ROUGE scores comparable to strong baselines.
- The method achieves higher lexical diversity on datasets like CNN/DailyMail, XSum, and WikiHow.
- The framework is fully differentiable and does not require additional heuristics or post-processing.

## Abstract

ROUGE is a common objective for extractive summarization because n-gram overlap aligns with sentence-level selection. However, models that focus only on ROUGE often choose sentences with similar content, and the resulting summaries contain redundant information. We propose DiCo-EXT, a training framework that integrates two new loss terms into a standard extractive model: a semantic consistency term and a diversity penalty. The consistency module encourages selected sentences to stay close to document-level meaning, and the diversity penalty reduces semantic overlap within the summary. Both components are fully differentiable and can be optimized together with the base loss, without extra heuristics or multi-stage post-processing. Experiments on CNN/DailyMail, XSum, and WikiHow show lower redundancy and higher lexical diversity, while ROUGE remains comparable to a strong baseline. These results indicate that simple training objectives can balance coverage and redundancy without increasing model size or architectural complexity.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12840116/full.md

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12840116/full.md

## References

40 references — full list in the complete paper: https://tomesphere.com/paper/PMC12840116/full.md

---
Source: https://tomesphere.com/paper/PMC12840116