Automatic Summarization of Long Documents

Naman Chhibbar; Jugal Kalita

arXiv:2410.05903·cs.CL·October 10, 2024

Automatic Summarization of Long Documents

Naman Chhibbar, Jugal Kalita

PDF

Open Access 1 Repo

TL;DR

This paper presents three innovative algorithms enabling large language models to process extremely long documents exceeding 70,000 words, significantly improving summarization quality without changing the model architecture.

Contribution

The study introduces novel algorithms that overcome input size limitations of transformer models, allowing effective summarization of very long texts without architectural modifications.

Findings

01

Significant increase in BERTScore on long documents

02

Competitive ROUGE scores achieved

03

Algorithms enable processing of texts over 70,000 words

Abstract

A vast amount of textual data is added to the internet daily, making utilization and interpretation of such data difficult and cumbersome. As a result, automatic text summarization is crucial for extracting relevant information, saving precious reading time. Although many transformer-based models excel in summarization, they are constrained by their input size, preventing them from processing texts longer than their context size. This study introduces three novel algorithms that allow any LLM to efficiently overcome its input size limitation, effectively utilizing its full potential without any architectural modifications. We test our algorithms on texts with more than 70,000 words, and our experiments show a significant increase in BERTScore with competitive ROUGE scores.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

NamanChhibbar/Long-Document-Summarizer
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsData Quality and Management · Topic Modeling · Data Mining Algorithms and Applications