Chain-of-Instructions: Compositional Instruction Tuning on Large   Language Models

Shirley Anugrah Hayati; Taehee Jung; Tristan Bodding-Long; Sudipta; Kar; Abhinav Sethy; Joo-Kyung Kim; Dongyeop Kang

arXiv:2402.11532·cs.CL·January 7, 2025·2 cites

Chain-of-Instructions: Compositional Instruction Tuning on Large Language Models

Shirley Anugrah Hayati, Taehee Jung, Tristan Bodding-Long, Sudipta, Kar, Abhinav Sethy, Joo-Kyung Kim, Dongyeop Kang

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces chain-of-instructions (CoI), a compositional instruction tuning method for large language models that enhances their ability to handle complex, multi-step, and unseen tasks by training them on chained subtasks.

Contribution

The paper proposes a novel CoI-tuning approach that improves LLMs' performance on complex and unseen multi-step instructions by leveraging chained subtasks during training.

Findings

01

CoI-tuning enhances handling of multi-subtask instructions

02

Improves generalization to unseen composite tasks

03

Enables better performance on complex, longer instruction chains

Abstract

Fine-tuning large language models (LLMs) with a collection of large and diverse instructions has improved the model's generalization to different tasks, even for unseen tasks. However, most existing instruction datasets include only single instructions, and they struggle to follow complex instructions composed of multiple subtasks. In this work, we propose a novel concept of compositional instructions called chain-of-instructions (CoI), where the output of one instruction becomes an input for the next like a chain. Unlike the conventional practice of solving single instruction tasks, our proposed method encourages a model to solve each subtask step by step until the final answer is reached. CoI-tuning (i.e., fine-tuning with CoI instructions) improves the model's ability to handle instructions composed of multiple subtasks as well as unseen composite tasks such as multilingual…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

amazon-science/chain-of-instructions
pytorchOfficial

Videos

Chain-of-Instructions: Compositional Instruction Tuning on Large Language Models· underline

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling