SPaCe: Unlocking Sample-Efficient Large Language Models Training With Self-Pace Curriculum Learning

Dai Do; Manh Nguyen; Svetha Venkatesh; Hung Le

arXiv:2508.05015·cs.LG·April 17, 2026

SPaCe: Unlocking Sample-Efficient Large Language Models Training With Self-Pace Curriculum Learning

Dai Do, Manh Nguyen, Svetha Venkatesh, Hung Le

PDF

TL;DR

SPaCe is a self-paced curriculum learning framework that improves large language model training efficiency by adaptively selecting data based on difficulty and model capability, reducing data needs significantly.

Contribution

The paper introduces SPaCe, a novel data reduction and adaptive sampling method that enhances training efficiency and performance of LLMs with fewer samples.

Findings

01

SPaCe achieves comparable or better accuracy than state-of-the-art methods.

02

It reduces training data requirements by up to 100 times.

03

Data clustering and adaptive selection are crucial for effectiveness.

Abstract

Large language models (LLMs) have shown strong reasoning capabilities when fine-tuned with reinforcement learning (RL). However, such methods require extensive data and compute, making them impractical under many realistic training budgets. Many existing pipelines sample training examples uniformly across steps or epochs, ignoring differences in difficulty, redundancy, and learning value, which slows learning and wastes computation. We propose \textbf{SPaCe}, a self-paced learning framework that enables efficient learning based on the capability of the model being trained through optimizing which data to use and when. First, we apply \emph{cluster-based data reduction} to partition training data by semantics and difficulty, extracting a compact yet diverse subset that reduces redundancy. Then, a \textit{multi-armed bandit} treats data clusters as arms, allocating training samples based…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.