Factual Dialogue Summarization via Learning from Large Language Models

Rongxin Zhu; Jey Han Lau; Jianzhong Qi

arXiv:2406.14709·cs.CL·June 24, 2024·1 cites

Factual Dialogue Summarization via Learning from Large Language Models

Rongxin Zhu, Jey Han Lau, Jianzhong Qi

PDF

Open Access

TL;DR

This paper proposes a symbolic knowledge distillation method using contrastive learning to improve factual consistency in dialogue summarization models, especially smaller ones, by leveraging large language models.

Contribution

It introduces a novel approach combining zero-shot knowledge extraction and contrastive learning to enhance factual accuracy in smaller dialogue summarization models.

Findings

01

Outperforms strong baselines in factual consistency.

02

Maintains coherence, fluency, and relevance.

03

Effective across multiple pretrained models.

Abstract

Factual consistency is an important quality in dialogue summarization. Large language model (LLM)-based automatic text summarization models generate more factually consistent summaries compared to those by smaller pretrained language models, but they face deployment challenges in real-world applications due to privacy or resource constraints. In this paper, we investigate the use of symbolic knowledge distillation to improve the factual consistency of smaller pretrained models for dialogue summarization. We employ zero-shot learning to extract symbolic knowledge from LLMs, generating both factually consistent (positive) and inconsistent (negative) summaries. We then apply two contrastive learning objectives on these summaries to enhance smaller summarization models. Experiments with BART, PEGASUS, and Flan-T5 indicate that our approach surpasses strong baselines that rely on complex…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Data Quality and Management · Natural Language Processing Techniques

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Residual Connection · Softmax · PEGASUS · Layer Normalization · Byte Pair Encoding · Dropout · Adam · Linear Layer