GLIMMER: Incorporating Graph and Lexical Features in Unsupervised Multi-Document Summarization
Ran Liu, Ming Liu, Min Yu, Jianguo Jiang, Gang Li, Dan Zhang, Jingyuan, Li, Xiang Meng, Weiqing Huang

TL;DR
GLIMMER is an unsupervised multi-document summarization method that leverages graph and lexical features to produce more coherent and informative summaries, outperforming existing models in zero-shot settings and human evaluations.
Contribution
It introduces a novel unsupervised approach combining graph and lexical features for multi-document summarization, avoiding reliance on large pre-trained models.
Findings
Outperforms existing unsupervised methods on multiple datasets.
Surpasses state-of-the-art pre-trained models in zero-shot ROUGE scores.
Achieves high human-rated readability and informativeness.
Abstract
Pre-trained language models are increasingly being used in multi-document summarization tasks. However, these models need large-scale corpora for pre-training and are domain-dependent. Other non-neural unsupervised summarization approaches mostly rely on key sentence extraction, which can lead to information loss. To address these challenges, we propose a lightweight yet effective unsupervised approach called GLIMMER: a Graph and LexIcal features based unsupervised Multi-docuMEnt summaRization approach. It first constructs a sentence graph from the source documents, then automatically identifies semantic clusters by mining low-level features from raw texts, thereby improving intra-cluster correlation and the fluency of generated sentences. Finally, it summarizes clusters into natural sentences. Experiments conducted on Multi-News, Multi-XScience and DUC-2004 demonstrate that our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Advanced Text Analysis Techniques · Natural Language Processing Techniques
MethodsPEGASUS
