CONFIT: Toward Faithful Dialogue Summarization with Linguistically-Informed Contrastive Fine-tuning
Xiangru Tang, Arjun Nair, Borui Wang, Bingyao Wang, Jai Desai, Aaron, Wade, Haoran Li, Asli Celikyilmaz, Yashar Mehdad, Dragomir Radev

TL;DR
This paper introduces ConFiT, a contrastive fine-tuning approach that leverages linguistic error typologies to improve factual consistency in dialogue summarization, significantly reducing errors and enhancing summary quality.
Contribution
It proposes a novel contrastive fine-tuning method using error-specific objectives based on linguistic error typology to improve factual accuracy in dialogue summarization.
Findings
Reduces factual errors in dialogue summaries
Improves automatic faithfulness metrics and human evaluation scores
Generalizes well to meeting summarization datasets
Abstract
Factual inconsistencies in generated summaries severely limit the practical applications of abstractive dialogue summarization. Although significant progress has been achieved by using pre-trained models, substantial amounts of hallucinated content are found during the human evaluation. Pre-trained models are most commonly fine-tuned with cross-entropy loss for text summarization, which may not be an optimal strategy. In this work, we provide a typology of factual errors with annotation data to highlight the types of errors and move away from a binary understanding of factuality. We further propose a training strategy that improves the factual consistency and overall quality of summaries via a novel contrastive fine-tuning, called ConFiT. Based on our linguistically-informed typology of errors, we design different modular objectives that each target a specific type. Specifically, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Speech and dialogue systems
