HERA: Improving Long Document Summarization using Large Language Models   with Context Packaging and Reordering

Taiji Li; Hao Chen; Fei Yu; Yin Zhang

arXiv:2502.00448·cs.CL·February 4, 2025

HERA: Improving Long Document Summarization using Large Language Models with Context Packaging and Reordering

Taiji Li, Hao Chen, Fei Yu, Yin Zhang

PDF

Open Access

TL;DR

HERA is a framework that enhances long document summarization by segmenting, retrieving, and reordering content to improve understanding and relevance in large language models without extra fine-tuning.

Contribution

HERA introduces a novel method of semantic segmentation, event-based retrieval, and reordering to improve long document summarization with LLMs.

Findings

01

HERA outperforms baseline models in ROUGE, BERTScore, and faithfulness.

02

It does not require additional fine-tuning or resources.

03

Effective in two long document summarization datasets.

Abstract

Despite the rapid growth of context length of large language models (LLMs) , LLMs still perform poorly in long document summarization. An important reason for this is that relevant information about an event is scattered throughout long documents, and the messy narrative order impairs the accurate understanding and utilization of LLMs for long documents. To address these issues, we propose a novel summary generation framework, called HERA. Specifically, we first segment a long document by its semantic structure and retrieve text segments about the same event, and finally reorder them to form the input context. We evaluate our approach on two long document summarization datasets. The experimental results show that HERA outperforms foundation models in ROUGE, BERTScore and faithfulness metrics, while HERA does not require additional fine-tuning and resources.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Data Quality and Management