Zero-Shot Strategies for Length-Controllable Summarization

Fabian Retkowski; Alexander Waibel

arXiv:2501.00233·cs.CL·February 12, 2025

Zero-Shot Strategies for Length-Controllable Summarization

Fabian Retkowski, Alexander Waibel

PDF

Open Access 1 Datasets 1 Video

TL;DR

This paper investigates the challenges of zero-shot length control in large language models for summarization, evaluates their performance, and proposes practical methods to improve length adherence without fine-tuning.

Contribution

It introduces a set of methods—length approximation, target adjustment, sample filtering, and automated revisions—that significantly enhance zero-shot length control in LLM summarization.

Findings

01

Substantial improvements in length compliance achieved

02

Methods maintain or improve summary quality

03

Reveals biases and variability in LLM length control

Abstract

Large language models (LLMs) struggle with precise length control, particularly in zero-shot settings. We conduct a comprehensive study evaluating LLMs' length control capabilities across multiple measures and propose practical methods to improve controllability. Our experiments with LLaMA 3 reveal stark differences in length adherence across measures and highlight inherent biases of the model. To address these challenges, we introduce a set of methods: length approximation, target adjustment, sample filtering, and automated revisions. By combining these methods, we demonstrate substantial improvements in length compliance while maintaining or enhancing summary quality, providing highly effective zero-shot strategies for precise length control without the need for model fine-tuning or architectural changes. With our work, we not only advance our understanding of LLM behavior in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

retkowski/length-controllability-evaluation
dataset· 5 dl
5 dl

Videos

Zero-Shot Strategies for Length-Controllable Summarization· underline

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Algorithms and Data Compression

MethodsSparse Evolutionary Training · LLaMA