InstructDiff: Domain-Adaptive Data Selection via Differential Entropy for Efficient LLM Fine-Tuning

Junyou Su; He Zhu; Xiao Luo; Liyu Zhang; Hong-Yu Zhou; Yun Chen; Peng Li; Yang Liu; Guanhua Chen

arXiv:2601.23006·cs.CL·February 2, 2026

InstructDiff: Domain-Adaptive Data Selection via Differential Entropy for Efficient LLM Fine-Tuning

Junyou Su, He Zhu, Xiao Luo, Liyu Zhang, Hong-Yu Zhou, Yun Chen, Peng Li, Yang Liu, Guanhua Chen

PDF

Open Access

TL;DR

InstructDiff is a domain-adaptive data selection method that uses differential entropy to efficiently fine-tune large language models, significantly reducing data requirements while improving performance across reasoning and general tasks.

Contribution

The paper introduces InstructDiff, a novel entropy-based data selection framework that adapts to domain-specific tasks, enhancing fine-tuning efficiency and effectiveness.

Findings

01

Achieves 17% improvement on mathematical reasoning tasks.

02

Achieves 52% improvement on general instruction-following tasks.

03

Uses only 10% of the data compared to full dataset training.

Abstract

Supervised fine-tuning (SFT) is fundamental to adapting large language models, yet training on complete datasets incurs prohibitive costs with diminishing returns. Existing data selection methods suffer from severe domain specificity: techniques optimized for general instruction-following fail on reasoning tasks, and vice versa. We observe that measuring entropy differences between base models and minimally instruction-tuned calibrated models reveals a pattern -- samples with the lowest differential entropy consistently yield optimal performance across domains, yet this principle manifests domain-adaptively: reasoning tasks favor entropy increase (cognitive expansion), while general tasks favor entropy decrease (cognitive compression). We introduce InstructDiff, a unified framework that operationalizes differential entropy as a domain-adaptive selection criterion through warmup…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Machine Learning and Data Classification