Phased Instruction Fine-Tuning for Large Language Models

Wei Pang; Chuan Zhou; Xiao-Hua Zhou; Xiaojie Wang

arXiv:2406.04371·cs.CL·June 18, 2024·1 cites

Phased Instruction Fine-Tuning for Large Language Models

Wei Pang, Chuan Zhou, Xiao-Hua Zhou, Xiaojie Wang

PDF

Open Access 1 Repo

TL;DR

This paper introduces Phased Instruction Fine-Tuning, a progressive approach that improves large language models' adherence to instructions by sequentially training on instruction subsets ordered by difficulty, outperforming traditional one-off methods.

Contribution

The paper proposes a novel phased fine-tuning method that enhances instruction-following by gradually increasing instruction difficulty, supported by experiments on multiple models and datasets.

Findings

01

Phased IFT significantly outperforms One-off IFT.

02

Progressive training supports the hypothesis of gradual instruction alignment.

03

Method is simple, efficient, and broadly applicable.

Abstract

Instruction Fine-Tuning enhances pre-trained language models from basic next-word prediction to complex instruction-following. However, existing One-off Instruction Fine-Tuning (One-off IFT) method, applied on a diverse instruction, may not effectively boost models' adherence to instructions due to the simultaneous handling of varying instruction complexities. To improve this, Phased Instruction Fine-Tuning (Phased IFT) is proposed, based on the idea that learning to follow instructions is a gradual process. It assesses instruction difficulty using GPT-4, divides the instruction data into subsets of increasing difficulty, and uptrains the model sequentially on these subsets. Experiments with Llama-2 7B/13B/70B, Llama3 8/70B and Mistral-7B models using Alpaca data show that Phased IFT significantly outperforms One-off IFT, supporting the progressive alignment hypothesis and providing a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

xubuvd/phasedsft
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Speech and dialogue systems · Natural Language Processing Techniques

MethodsResidual Connection · Softmax · Layer Normalization · Byte Pair Encoding · Label Smoothing · Adam · Attention Is All You Need · Linear Layer · Multi-Head Attention · Position-Wise Feed-Forward Layer