Leveraging Non-dialogue Summaries for Dialogue Summarization

Seongmin Park; Dongchan Shin; Jihwa Lee

arXiv:2210.09474·cs.CL·October 19, 2022

Leveraging Non-dialogue Summaries for Dialogue Summarization

Seongmin Park, Dongchan Shin, Jihwa Lee

PDF

Open Access

TL;DR

This paper proposes a method to improve dialogue summarization by leveraging non-dialogue summarization datasets through data transformation techniques, demonstrating significant benefits especially in low-resource scenarios.

Contribution

It introduces data transformation methods to adapt non-dialogue datasets for dialogue summarization, enhancing performance and faithfulness in low-resource settings.

Findings

01

Significant performance gains in zero- and few-shot settings.

02

Improved faithfulness to source text across training regimes.

03

Effective use of non-dialogue data for dialogue summarization.

Abstract

To mitigate the lack of diverse dialogue summarization datasets in academia, we present methods to utilize non-dialogue summarization data for enhancing dialogue summarization systems. We apply transformations to document summarization data pairs to create training data that better befit dialogue summarization. The suggested transformations also retain desirable properties of non-dialogue datasets, such as improved faithfulness to the source text. We conduct extensive experiments across both English and Korean to verify our approach. Although absolute gains in ROUGE naturally plateau as more dialogue summarization samples are introduced, utilizing non-dialogue data for training significantly improves summarization performance in zero- and few-shot settings and enhances faithfulness across all training regimes.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques