# An adaptive fusion-based data augmentation method for abstract dialogue summarization

**Authors:** Weihao Li, Dan Jiang, Han Zhang, Kejing Xiao, Shaozhong Cao

PMC · DOI: 10.7717/peerj-cs.2845 · PeerJ Computer Science · 2025-04-18

## TL;DR

This paper introduces a new data augmentation method for dialogue summarization that improves model performance with less labeled data.

## Contribution

The novel adaptive fusion method (AAF) combines two augmentation techniques to enhance model training under resource constraints.

## Key findings

- The AAF method improves ROUGE scores on DialogSum and SAMSum datasets compared to baselines.
- The amount of augmented data significantly affects model performance in resource-limited settings.

## Abstract

The dialogue summarization is necessary for information retrieval, and the training of abstract dialogue summarization models heavily rely on large amounts of labeled data. However, manual summarization of long dialogue is labor-costing and time-consuming. To solve this problem, this article proposes a data augmentation method for dialogue summary based on adaptive augmentation fusion (AAF), integrating the strengths of both Minor Perturbation Augmentation (MPA) and Semantic Reconstructive Augmentation (SRA) to balance model learning effectiveness and generalization capabilities. We first integrated existing enhancement methods to address the problem of insufficient annotated data for dialogue summarization. The experimental results on both the DialogSum and SAMSum datasets demonstrate that the AAF method achieves significant improvements in ROUGE scores under resource-constrained conditions, outperforming baseline approaches. Furthermore, it was validated that the selection of the amount of augmented data has a significant impact on model training results under resource-constrained conditions. We have publically released our code at https://github.com/alolke/AAF.

## Full-text entities

- **Genes:** F3 (coagulation factor III, tissue factor) [NCBI Gene 2152] {aka CD142, TF, TFA}, NINL (ninein like) [NCBI Gene 22981] {aka NLP}
- **Diseases:** AAF (MESH:D018489)
- **Chemicals:** Compo (-)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12192768/full.md

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12192768/full.md

## References

40 references — full list in the complete paper: https://tomesphere.com/paper/PMC12192768/full.md

---
Source: https://tomesphere.com/paper/PMC12192768