Fine-tuning Large Language Models with Sequential Instructions

Hanxu Hu; Simon Yu; Pinzhen Chen; Edoardo M. Ponti

arXiv:2403.07794·cs.CL·July 4, 2024·1 cites

Fine-tuning Large Language Models with Sequential Instructions

Hanxu Hu, Simon Yu, Pinzhen Chen, Edoardo M. Ponti

PDF

Open Access 5 Datasets 1 Video

TL;DR

This paper introduces a sequential instruction tuning method for large language models, improving their performance on complex multi-step tasks by incorporating chains of interrelated instructions and proposing a new evaluation benchmark.

Contribution

It proposes a novel sequential instruction tuning approach, automates it using existing datasets, and introduces SeqEval, a benchmark for assessing multi-instruction following capabilities.

Findings

01

Enhanced performance in coding, maths, and open-ended tasks

02

Improved ability to follow complex instruction sequences

03

Introduced a new benchmark for sequential instruction evaluation

Abstract

Despite the success of existing instruction-tuned models, we find that they usually struggle to respond to queries with multiple instructions. This impairs their performance in complex problems whose solution consists of multiple intermediate tasks. Thus, we contend that part of the fine-tuning data mixture should be sequential--containing a chain of interrelated tasks. We first approach sequential instruction tuning from a task-driven perspective, manually creating interpretable intermediate tasks for multilingual and visual question answering: namely "translate then predict" and "caption then answer". Next, we automate this process by turning instructions in existing datasets (e.g., Alpaca and FlanCoT) into diverse and complex sequential instructions, making our method general-purpose. Models that underwent our sequential instruction tuning show improved results in coding, maths, and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

Videos

Fine-Tuning Large Language Models with Sequential Instructions· underline

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Machine Learning and Algorithms