Preference-aware Influence-function-based Data Selection Method for Efficient Fine-Tuning

Qihao Lin; Guanxu Chen; Dongrui Liu; Jing Shao

arXiv:2605.21422·cs.LG·May 21, 2026

Preference-aware Influence-function-based Data Selection Method for Efficient Fine-Tuning

Qihao Lin, Guanxu Chen, Dongrui Liu, Jing Shao

PDF

TL;DR

PRISM is a data selection method that uses model preferences to weight target examples and improve fine-tuning efficiency for large language models.

Contribution

It introduces a preference-aware influence function approach that enhances data selection by considering the relevance of target examples to the current model.

Findings

01

PRISM improves fine-tuning efficiency across various model scales.

02

It enhances safety-oriented supervised fine-tuning repair.

03

Preference weighting leads to more effective target-behavior guidance.

Abstract

As LLMs continue to scale, improving training efficiency increasingly depends on using data more effectively. Data selection addresses this problem by allocating a limited training budget to samples that best promote a target behavior. Existing methods usually represent the target behavior with a set of target examples, but often treat these examples as equally important. This can be inefficient because target examples may differ in their relevance to the current model: examples closer to the model's current behavior provide more actionable guidance than those farther away. We propose PRISM (PReference-aware Influence-function-based Data Selection Method for Efficient Fine-Tuning), which uses the current model's preference to weight target examples and construct a preference-aware target representation. PRISM then scores candidate training samples by their alignment with this…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.