Beyond Single-Task: Robust Multi-Task Length Generalization for LLMs
Yi Hu, Shijia Kang, Haotong Yang, Haotian Xu, Muhan Zhang

TL;DR
This paper introduces Meta-RFFT, a framework that enables large language models to generalize length understanding across multiple tasks, significantly improving performance on unseen tasks with minimal fine-tuning.
Contribution
It presents Meta-RFFT, the first method for robust multi-task length generalization, and constructs a large diverse dataset for evaluating this capability.
Findings
Meta-RFFT achieves high accuracy on unseen tasks with minimal fine-tuning.
Models outperform state-of-the-art reasoning models on long addition tasks.
Constructed a dataset with 86 diverse tasks for length generalization evaluation.
Abstract
Length generalization, the ability to solve problems longer than those seen during training, remains a critical challenge for large language models (LLMs). Previous work modifies positional encodings (PEs) and data formats to improve length generalization on specific symbolic tasks such as addition and sorting. However, these approaches are fundamentally limited to special tasks, often degrading general language performance. Furthermore, they are typically evaluated on small transformers trained from scratch on single tasks and can cause performance drop when applied during post-training stage of practical LLMs with general capabilities. Hu et al., (2024) proposed Rule-Following Fine-Tuning (RFFT) to improve length generalization in the post-training stage of LLMs. Despite its compatibility with practical models and strong performance, RFFT is proposed for single tasks too, requiring…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques
