P3: A Policy-Driven, Pace-Adaptive, and Diversity-Promoted Framework for data pruning in LLM Training
Yingxuan Yang, Huayi Wang, Muning Wen, Xiaoyun Mo, Qiuying Peng, Jun, Wang, Weinan Zhang

TL;DR
P3 is an adaptive data pruning framework for LLM fine-tuning that dynamically assesses data difficulty, progressively introduces challenging data, and promotes diversity to enhance model performance.
Contribution
It introduces a novel, policy-driven, pace-adaptive, and diversity-promoted framework for data pruning in LLM training, combining real-time difficulty assessment, self-paced learning, and DPP-based diversity.
Findings
Significant performance improvements on reasoning tasks.
Effective dynamic data selection enhances fine-tuning.
Outperforms traditional data pruning methods.
Abstract
In the rapidly advancing field of Large Language Models (LLMs), effectively leveraging existing datasets during fine-tuning to maximize the model's potential is of paramount importance. This paper introduces P3, an adaptive framework aimed at optimizing the task-specific fine-tuning process through iterative data pruning. P3 consists of three key components: (1) Policy-driven Difficulty Measurement, which dynamically assesses data difficulty based on the model's real-time performance, replacing static metrics with adaptable evaluations; (2) Pace-Adaptive Selection, leveraging self-paced learning to progressively introduce more challenging data, thereby enhancing model capability; (3) Diversity Promotion, incorporating Determinantal Point Process (DPP) to ensure data diversity across epochs, enriching the learning process. We validate P3 on the reasoning scenarios, APPS and MATH,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsWikis in Education and Collaboration
MethodsPruning
