DialogLM: Pre-trained Model for Long Dialogue Understanding and   Summarization

Ming Zhong; Yang Liu; Yichong Xu; Chenguang Zhu; Michael Zeng

arXiv:2109.02492·cs.CL·January 7, 2022

DialogLM: Pre-trained Model for Long Dialogue Understanding and Summarization

Ming Zhong, Yang Liu, Yichong Xu, Chenguang Zhu, Michael Zeng

PDF

Open Access 1 Repo 2 Models 1 Video

TL;DR

DialogLM is a pre-trained model designed for understanding and summarizing long multi-person dialogues, employing a window-based denoising approach and hybrid sparse attention to outperform existing models across various dialogue tasks.

Contribution

The paper introduces a novel pre-training framework with window-based denoising and hybrid sparse attention for long dialogue understanding and summarization.

Findings

01

Significantly outperforms state-of-the-art models on multiple dialogue datasets.

02

Effective in tasks like dialogue summarization, question answering, and topic segmentation.

03

Demonstrates robustness on long, multi-person conversations.

Abstract

Dialogue is an essential part of human communication and cooperation. Existing research mainly focuses on short dialogue scenarios in a one-on-one fashion. However, multi-person interactions in the real world, such as meetings or interviews, are frequently over a few thousand words. There is still a lack of corresponding research and powerful tools to understand and process such long dialogues. Therefore, in this work, we present a pre-training framework for long dialogue understanding and summarization. Considering the nature of long conversations, we propose a window-based denoising approach for generative pre-training. For a dialogue, it corrupts a window of text with dialogue-inspired noise, and guides the model to reconstruct this window based on the content of the remaining conversation. Furthermore, to process longer input, we augment the model with sparse attention which is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

microsoft/dialoglm
pytorchOfficial

Models

Videos

DialogLM: Pre-Trained Model for Long Dialogue Understanding and Summarization· underline

Taxonomy

TopicsTopic Modeling · Speech and dialogue systems · Multimodal Machine Learning Applications