Preference-aware Influence-function-based Data Selection Method for Efficient Fine-Tuning
Qihao Lin, Guanxu Chen, Dongrui Liu, Jing Shao

TL;DR
PRISM is a data selection method that uses model preferences to weight target examples and improve fine-tuning efficiency for large language models.
Contribution
It introduces a preference-aware influence function approach that enhances data selection by considering the relevance of target examples to the current model.
Findings
PRISM improves fine-tuning efficiency across various model scales.
It enhances safety-oriented supervised fine-tuning repair.
Preference weighting leads to more effective target-behavior guidance.
Abstract
As LLMs continue to scale, improving training efficiency increasingly depends on using data more effectively. Data selection addresses this problem by allocating a limited training budget to samples that best promote a target behavior. Existing methods usually represent the target behavior with a set of target examples, but often treat these examples as equally important. This can be inefficient because target examples may differ in their relevance to the current model: examples closer to the model's current behavior provide more actionable guidance than those farther away. We propose PRISM (PReference-aware Influence-function-based Data Selection Method for Efficient Fine-Tuning), which uses the current model's preference to weight target examples and construct a preference-aware target representation. PRISM then scores candidate training samples by their alignment with this…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
