TANet: Thread-Aware Pretraining for Abstractive Conversational   Summarization

Ze Yang; Liran Wang; Zhoujin Tian; Wei Wu; Zhoujun Li

arXiv:2204.04504·cs.CL·April 12, 2022

TANet: Thread-Aware Pretraining for Abstractive Conversational Summarization

Ze Yang, Liran Wang, Zhoujin Tian, Wei Wu, Zhoujun Li

PDF

Open Access

TL;DR

This paper introduces TANet, a thread-aware Transformer model trained on a large Reddit dataset, which effectively captures conversational structure for abstractive summarization, outperforming previous models on multiple datasets.

Contribution

The paper presents a novel thread-aware pretraining approach that incorporates structural conversation information into Transformer models for better summarization.

Findings

01

TANet achieves state-of-the-art results on four real-world datasets.

02

Incorporating structural information improves summarization quality.

03

The large Reddit dataset (RCS) enhances pretraining effectiveness.

Abstract

Although pre-trained language models (PLMs) have achieved great success and become a milestone in NLP, abstractive conversational summarization remains a challenging but less studied task. The difficulty lies in two aspects. One is the lack of large-scale conversational summary data. Another is that applying the existing pre-trained models to this task is tricky because of the structural dependence within the conversation and its informal expression, etc. In this work, we first build a large-scale (11M) pretraining dataset called RCS, based on the multi-person discussions in the Reddit community. We then present TANet, a thread-aware Transformer-based network. Unlike the existing pre-trained models that treat a conversation as a sequence of sentences, we argue that the inherent contextual dependency among the utterances plays an essential role in understanding the entire conversation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques