Chain of Summaries: Summarization Through Iterative Questioning
William Brach, Kristi\'an Ko\v{s}\v{t}\'al, Lukas Galke Poech

TL;DR
The paper introduces Chain of Summaries (CoS), an iterative summarization method inspired by dialectical reasoning, which produces dense, general-purpose summaries that improve LLM performance on various datasets and are more accessible for web content use.
Contribution
It presents a novel iterative summarization approach, CoS, that refines summaries through questioning, outperforming existing methods and enhancing LLM understanding and retrieval efficiency.
Findings
CoS outperforms zero-shot LLM baselines by up to 66%.
CoS surpasses specialized summarization methods by up to 27%.
Summaries generated by CoS improve Q&A performance and reduce token usage.
Abstract
Large Language Models (LLMs) are increasingly using external web content. However, much of this content is not easily digestible by LLMs due to LLM-unfriendly formats and limitations of context length. To address this issue, we propose a method for generating general-purpose, information-dense summaries that act as plain-text repositories of web content. Inspired by Hegel's dialectical method, our approach, denoted as Chain of Summaries (CoS), iteratively refines an initial summary (thesis) by identifying its limitations through questioning (antithesis), leading to a general-purpose summary (synthesis) that can satisfy current and anticipate future information needs. Experiments on the TriviaQA, TruthfulQA, and SQUAD datasets demonstrate that CoS outperforms zero-shot LLM baselines by up to 66\% and specialized summarization methods such as Chain of Density, BRIO and PEGASUS by up to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Computational and Text Analysis Methods · Text and Document Classification Technologies
