StrucSum: Graph-Structured Reasoning for Long Document Extractive Summarization with LLMs
Haohan Yuan, Sukhwa Hong, Haopeng Zhang

TL;DR
StrucSum is a prompting framework that enhances large language models' ability to perform extractive summarization of long documents by incorporating graph-structured reasoning, leading to improved summary quality and factual consistency.
Contribution
Introduces StrucSum, a training-free prompting method that integrates document structure into LLM reasoning using graph-based strategies for better long document summarization.
Findings
Significantly improves summary quality and factual consistency on multiple datasets.
Outperforms unsupervised baselines and vanilla prompting methods.
Demonstrates the effectiveness of structure-aware prompting with graph signals.
Abstract
Large language models (LLMs) have shown strong performance in zero-shot summarization, but often struggle to model document structure and identify salient information in long texts. In this work, we introduce StrucSum, a training-free prompting framework that enhances LLM reasoning through sentence-level graph structures. StrucSum injects structural signals into prompts via three targeted strategies: Neighbor-Aware Prompting (NAP) for local context, Centrality-Aware Prompting (CAP) for importance estimation, and Centrality-Guided Masking (CGM) for efficient input reduction. Experiments on ArXiv, PubMed, and Multi-News demonstrate that StrucSum consistently improves both summary quality and factual consistency over unsupervised baselines and vanilla prompting. In particular, on ArXiv, it increases FactCC and SummaC by 19.2\% and 8.0\% points, demonstrating stronger alignment between…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsTopic Modeling · Text Readability and Simplification · Biomedical Text Mining and Ontologies
