Compressed Heterogeneous Graph for Abstractive Multi-Document Summarization
Miao Li, Jianzhong Qi, Jey Han Lau

TL;DR
HGSUM introduces a heterogeneous graph-based encoder-decoder model for multi-document summarization that effectively captures diverse semantic relationships and uses graph compression to improve summary quality, outperforming existing models.
Contribution
The paper presents HGSUM, a novel MDS model that incorporates heterogeneous graphs and graph pooling for better relationship modeling and summary generation.
Findings
HGSUM outperforms state-of-the-art MDS models on multiple datasets.
Graph compression guided by similarity improves summary relevance.
End-to-end training with graph objectives enhances model performance.
Abstract
Multi-document summarization (MDS) aims to generate a summary for a number of related documents. We propose HGSUM, an MDS model that extends an encoder-decoder architecture, to incorporate a heterogeneous graph to represent different semantic units (e.g., words and sentences) of the documents. This contrasts with existing MDS models which do not consider different edge types of graphs and as such do not capture the diversity of relationships in the documents. To preserve only key information and relationships of the documents in the heterogeneous graph, HGSUM uses graph pooling to compress the input graph. And to guide HGSUM to learn compression, we introduce an additional objective that maximizes the similarity between the compressed graph and the graph constructed from the ground-truth summary during training. HGSUM is trained end-to-end with graph similarity and standard…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques
