Beyond What to Select: A Plug-and-play Oscillatory Data-Volume Scheduling for Efficient Model Training

Suorong Yang; Hanqi Zhu; Hai Gan; Fangjian Su; Guang Li; Furao Shen; Soujanya Poria

arXiv:2605.14773·cs.LG·May 15, 2026

Beyond What to Select: A Plug-and-play Oscillatory Data-Volume Scheduling for Efficient Model Training

Suorong Yang, Hanqi Zhu, Hai Gan, Fangjian Su, Guang Li, Furao Shen, Soujanya Poria

PDF

TL;DR

This paper introduces PODS, a dynamic data-volume scheduling framework that alternates data selection ratios during training to improve efficiency and generalization across multiple tasks.

Contribution

PODS is a lightweight, task-agnostic module that dynamically schedules data volume, enhancing training efficiency without sacrificing model performance.

Findings

01

Reduces ImageNet-1k training cost by 50%.

02

Speeds up LLM instruction tuning by over 2x.

03

Improves the efficiency-generalization trade-off across tasks.

Abstract

Data selection accelerates training by identifying representative training data while preserving model performance. However, existing methods mainly focus on designing sample-importance criteria, i.e., deciding what to select, while typically fixing the selected data volume as the target ratio throughout training. Thus, they are often dynamic in sample identity but static in data volume. In this work, we revisit data selection from an optimization perspective and show that selected-data training induces an implicit regularization effect modulated by the instantaneous selection ratio. This reveals a key trade-off: lower ratios amplify selection-induced regularization, whereas higher ratios preserve data coverage and optimization fidelity. Motivated by this insight, we propose PODS, a Plug-and-play Oscillatory Data-volume Scheduling framework. Rather than introducing another…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.