Disentangling Specificity for Abstractive Multi-document Summarization

Congbo Ma; Wei Emma Zhang; Hu Wang; Haojie Zhuang; Mingyu Guo

arXiv:2406.00005·cs.IR·June 4, 2024

Disentangling Specificity for Abstractive Multi-document Summarization

Congbo Ma, Wei Emma Zhang, Hu Wang, Haojie Zhuang, Mingyu Guo

PDF

Open Access 1 Repo

TL;DR

This paper introduces a method to disentangle document-specific content from shared information in multi-document summarization, enhancing summary comprehensiveness by explicitly modeling unique document details.

Contribution

It proposes a novel disentanglement approach with an orthogonal constraint to better capture document-specific information in MDS.

Findings

01

Disentangling specific content improves summary quality.

02

Shared information contributes less to MDS performance.

03

Combining specific and shared representations yields more comprehensive summaries.

Abstract

Multi-document summarization (MDS) generates a summary from a document set. Each document in a set describes topic-relevant concepts, while per document also has its unique contents. However, the document specificity receives little attention from existing MDS approaches. Neglecting specific information for each document limits the comprehensiveness of the generated summaries. To solve this problem, in this paper, we propose to disentangle the specific content from documents in one document set. The document-specific representations, which are encouraged to be distant from each other via a proposed orthogonal constraint, are learned by the specific representation learner. We provide extensive analysis and have interesting findings that specific information and document set representations contribute distinctive strengths and their combination yields a more comprehensive solution for the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

congboma/disentanglesum
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Text Analysis Techniques · Topic Modeling · Web Data Mining and Analysis

MethodsSparse Evolutionary Training