From Parameters to Data: A Task-Parameter-Guided Fine-Tuning Pipeline for Efficient LLM Alignment

Hao Chen; Qi Zhang; Liyao Li; Zhanming Shen; Wentao Ye; Lirong Gao; Ningtao Wang; Xing Fu; Xiaoyu Shen; Junbo Zhao

arXiv:2605.21558·cs.LG·May 22, 2026

From Parameters to Data: A Task-Parameter-Guided Fine-Tuning Pipeline for Efficient LLM Alignment

Hao Chen, Qi Zhang, Liyao Li, Zhanming Shen, Wentao Ye, Lirong Gao, Ningtao Wang, Xing Fu, Xiaoyu Shen, Junbo Zhao

PDF

TL;DR

This paper introduces P2D, a unified framework that leverages task-sensitive attention heads in LLMs to efficiently select data and fine-tune models, significantly reducing time and resources needed for domain adaptation.

Contribution

The paper proposes a novel task-parameter-guided pipeline that couples attention head analysis with data selection, achieving efficient LLM alignment with minimal parameter updates.

Findings

01

Updating 10% of attention heads on 10% of data yields 8.3 pp performance gain.

02

P2D achieves a 7.0x end-to-end speedup in the alignment process.

03

The framework validates that parameter-data synchronization reduces redundancy.

Abstract

Adapting Large Language Models (LLMs) to specialized domains typically incurs high data and computational overhead. While prior efficiency efforts have largely treated data selection and parameter-efficient fine-tuning as isolated processes, our empirical analysis suggests they may be intrinsically coupled. We posit the Strong Map Hypothesis: a sparse subset of attention heads plays a dominant role in task-specific adaptation, acting as keys that unlock specific data patterns. Building on this observation, we propose From Parameters to Data (P2D), a unified framework that leverages these task-sensitive attention heads as a dual compass for both sample mining and structural pruning. To rigorously quantify the total pipeline cost, we introduce the Alignment Efficiency Ratio (AER) metric for both selection latency and training time. Mechanistically, P2D identifies critical heads via a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.