Influence-Preserving Proxies for Gradient-Based Data Selection in LLM Fine-tuning

Sirui Chen; Yunzhe Qi; Mengting Ai; Yifan Sun; Ruizhong Qiu; Jiaru Zou; Jingrui He

arXiv:2602.17835·cs.LG·February 23, 2026

Influence-Preserving Proxies for Gradient-Based Data Selection in LLM Fine-tuning

Sirui Chen, Yunzhe Qi, Mengting Ai, Yifan Sun, Ruizhong Qiu, Jiaru Zou, Jingrui He

PDF

Open Access

TL;DR

This paper introduces Iprox, a framework for creating influence-preserving proxies directly from target LLMs, enabling scalable gradient-based data selection that outperforms existing proxy methods in efficiency and effectiveness.

Contribution

Iprox is a novel two-stage method that constructs influence-preserving proxies from the target model, improving scalability and performance in gradient-based data selection for LLM fine-tuning.

Findings

01

Iprox outperforms off-the-shelf proxies across diverse LLMs and tasks.

02

A 1.5B proxy with Iprox surpasses a larger 1.7B off-the-shelf proxy.

03

Iprox reduces computational cost by over 50% while maintaining superior performance.

Abstract

Supervised fine-tuning (SFT) relies critically on selecting training data that most benefits a model's downstream performance. Gradient-based data selection methods such as TracIn and Influence Functions leverage influence to identify useful samples, but their computational cost scales poorly, making them impractical for multi-billion-parameter large language models (LLMs). A common alternative is to use off-the-shelf smaller models as proxies, but they remain suboptimal since their learning dynamics are unclear, their sizes cannot be flexibly adjusted, and they cannot be further aligned with the target model in terms of gradient-based influence estimation. To address these challenges, we introduce Iprox, a two-stage framework that derives influence-preserving proxies directly from the target model. It first applies a low-rank compression stage to preserve influence information of the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech Recognition and Synthesis