Beyond Single-Task: Robust Multi-Task Length Generalization for LLMs

Yi Hu; Shijia Kang; Haotong Yang; Haotian Xu; Muhan Zhang

arXiv:2502.11525·cs.CL·May 20, 2025

Beyond Single-Task: Robust Multi-Task Length Generalization for LLMs

Yi Hu, Shijia Kang, Haotong Yang, Haotian Xu, Muhan Zhang

PDF

Open Access

TL;DR

This paper introduces Meta-RFFT, a framework that enables large language models to generalize length understanding across multiple tasks, significantly improving performance on unseen tasks with minimal fine-tuning.

Contribution

It presents Meta-RFFT, the first method for robust multi-task length generalization, and constructs a large diverse dataset for evaluating this capability.

Findings

01

Meta-RFFT achieves high accuracy on unseen tasks with minimal fine-tuning.

02

Models outperform state-of-the-art reasoning models on long addition tasks.

03

Constructed a dataset with 86 diverse tasks for length generalization evaluation.

Abstract

Length generalization, the ability to solve problems longer than those seen during training, remains a critical challenge for large language models (LLMs). Previous work modifies positional encodings (PEs) and data formats to improve length generalization on specific symbolic tasks such as addition and sorting. However, these approaches are fundamentally limited to special tasks, often degrading general language performance. Furthermore, they are typically evaluated on small transformers trained from scratch on single tasks and can cause performance drop when applied during post-training stage of practical LLMs with general capabilities. Hu et al., (2024) proposed Rule-Following Fine-Tuning (RFFT) to improve length generalization in the post-training stage of LLMs. Despite its compatibility with practical models and strong performance, RFFT is proposed for single tasks too, requiring…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques