DataFlex: A Unified Framework for Data-Centric Dynamic Training of Large Language Models

Hao Liang; Zhengyang Zhao; Meiyi Qiang; Mingrui Chen; Lu Ma; Rongyi Yu; Hengyi Feng; Shixuan Sun; Zimo Meng; Xiaochen Ma; Xuanlin Yang; Qifeng Cai; Ruichuan An; Bohan Zeng; Zhen Hao Wong; Chengyu Shen; Runming He; Zhaoyang Han; Yaowei Zheng; Fangcheng Fu; Conghui He; Bin Cui; Zhiyu Li; Weinan E; Wentao Zhang

arXiv:2603.26164·cs.LG·March 30, 2026

DataFlex: A Unified Framework for Data-Centric Dynamic Training of Large Language Models

Hao Liang, Zhengyang Zhao, Meiyi Qiang, Mingrui Chen, Lu Ma, Rongyi Yu, Hengyi Feng, Shixuan Sun, Zimo Meng, Xiaochen Ma, Xuanlin Yang, Qifeng Cai, Ruichuan An, Bohan Zeng, Zhen Hao Wong, Chengyu Shen, Runming He, Zhaoyang Han, Yaowei Zheng, Fangcheng Fu, Conghui He, Bin Cui

PDF

1 Repo

TL;DR

DataFlex is a unified, flexible framework that enhances large language model training by integrating data selection, mixture adjustment, and reweighting, leading to improved performance and efficiency.

Contribution

It introduces a comprehensive, modular framework for data-centric dynamic training of LLMs, supporting multiple paradigms with compatibility and scalability.

Findings

01

Dynamic data selection improves MMLU performance across models.

02

Data mixture methods like DoReMi and ODM enhance accuracy and perplexity.

03

DataFlex achieves runtime improvements over existing methods.

Abstract

Data-centric training has emerged as a promising direction for improving large language models (LLMs) by optimizing not only model parameters but also the selection, composition, and weighting of training data during optimization. However, existing approaches to data selection, data mixture optimization, and data reweighting are often developed in isolated codebases with inconsistent interfaces, hindering reproducibility, fair comparison, and practical integration. In this paper, we present DataFlex, a unified data-centric dynamic training framework built upon LLaMA-Factory. DataFlex supports three major paradigms of dynamic data optimization: sample selection, domain mixture adjustment, and sample reweighting, while remaining fully compatible with the original training workflow. It provides extensible trainer abstractions and modular components, enabling a drop-in replacement for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

opendcai/DataFlex
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.