Exploring Format Consistency for Instruction Tuning

Shihao Liang; Runchu Tian; Kunlun Zhu; Yujia Qin; Huadong Wang; Xin; Cong; Zhiyuan Liu; Xiaojiang Liu; Maosong Sun

arXiv:2307.15504·cs.CL·January 9, 2024

Exploring Format Consistency for Instruction Tuning

Shihao Liang, Runchu Tian, Kunlun Zhu, Yujia Qin, Huadong Wang, Xin, Cong, Zhiyuan Liu, Xiaojiang Liu, Maosong Sun

PDF

Open Access 1 Repo

TL;DR

This paper introduces a framework called Unified Instruction Tuning (UIT) that uses automatic format transfer to improve instruction tuning consistency and performance across diverse datasets for large language models.

Contribution

The paper presents a novel framework for automatic format transfer in instruction tuning, demonstrating its importance and effectiveness in enhancing model generalization.

Findings

01

Format consistency improves generalization performance.

02

The proposed denoising method reduces transfer noise effectively.

03

A smaller offline model achieves comparable transfer capabilities.

Abstract

Instruction tuning has emerged as a promising approach to enhancing large language models in following human instructions. It is shown that increasing the diversity and number of instructions in the training data can consistently enhance generalization performance, which facilitates a recent endeavor to collect various instructions and integrate existing instruction tuning datasets into larger collections. However, different users have their unique ways of expressing instructions, and there often exist variations across different datasets in the instruction styles and formats, i.e., format inconsistency. In this work, we propose a framework named Unified Instruction Tuning (UIT), which calls OpenAI APIs for automatic format transfer among different instruction tuning datasets such as PromptSource, FLAN and CrossFit. With the framework, we (1) demonstrate the necessity of maintaining…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

thunlp/unifiedinstructiontuning
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Speech Recognition and Synthesis