Mutual Reinforcement of LLM Dialogue Synthesis and Summarization   Capabilities for Few-Shot Dialogue Summarization

Yen-Ju Lu; Ting-Yao Hu; Hema Swetha Koppula; Hadi Pouransari; Jen-Hao; Rick Chang; Yin Xia; Xiang Kong; Qi Zhu; Simon Wang; Oncel Tuzel; Raviteja; Vemulapalli

arXiv:2502.17328·cs.CL·February 25, 2025

Mutual Reinforcement of LLM Dialogue Synthesis and Summarization Capabilities for Few-Shot Dialogue Summarization

Yen-Ju Lu, Ting-Yao Hu, Hema Swetha Koppula, Hadi Pouransari, Jen-Hao, Rick Chang, Yin Xia, Xiang Kong, Qi Zhu, Simon Wang, Oncel Tuzel, Raviteja, Vemulapalli

PDF

Open Access 1 Video

TL;DR

This paper introduces a mutual reinforcement data synthesis method within large language models to enhance few-shot dialogue summarization by mutually improving dialogue generation and summarization capabilities, leading to better performance and human evaluation scores.

Contribution

The paper presents a novel mutual reinforcement mechanism that leverages internal LLM knowledge to generate synthetic data, improving dialogue summarization without external resources.

Findings

01

Achieved 1.5% higher ROUGE scores in few-shot settings.

02

Improved BERT scores by 0.3% in experiments.

03

Outperformed baselines in human evaluations.

Abstract

In this work, we propose Mutual Reinforcing Data Synthesis (MRDS) within LLMs to improve few-shot dialogue summarization task. Unlike prior methods that require external knowledge, we mutually reinforce the LLM\'s dialogue synthesis and summarization capabilities, allowing them to complement each other during training and enhance overall performances. The dialogue synthesis capability is enhanced by directed preference optimization with preference scoring from summarization capability. The summarization capability is enhanced by the additional high quality dialogue-summary paired data produced by the dialogue synthesis capability. By leveraging the proposed MRDS mechanism, we elicit the internal knowledge of LLM in the format of synthetic data, and use it to augment the few-shot real training dataset. Empirical results demonstrate that our method improves dialogue summarization,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Mutual Reinforcement of LLM Dialogue Synthesis and Summarization Capabilities for Few-Shot Dialogue Summarization· underline

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech and dialogue systems

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Adam · Softmax · Dropout · Weight Decay · Linear Layer · Layer Normalization · WordPiece · Dense Connections