Beyond IID: Optimizing Instruction Learning from the Perspective of   Instruction Interaction and Dependency

Hanyu Zhao; Li Du; Yiming Ju; Chengwei Wu; Tengfei Pan

arXiv:2409.07045·cs.CL·September 12, 2024

Beyond IID: Optimizing Instruction Learning from the Perspective of Instruction Interaction and Dependency

Hanyu Zhao, Li Du, Yiming Ju, Chengwei Wu, Tengfei Pan

PDF

Open Access 4 Datasets

TL;DR

This paper explores how interactions and dependencies among diverse instruction types affect fine-tuning large language models, proposing methods to optimize instruction sets and learning schemas for improved performance.

Contribution

It introduces a systematic analysis of instruction interaction patterns and develops optimization techniques using linear programming and curriculum learning.

Findings

01

Enhanced LLM performance on benchmarks

02

Effective instruction set optimization methods

03

Insights into instruction interaction patterns

Abstract

With the availability of various instruction datasets, a pivotal challenge is how to effectively select and integrate these instructions to fine-tune large language models (LLMs). Previous research mainly focuses on selecting individual high-quality instructions. However, these works overlooked the joint interactions and dependencies between different categories of instructions, leading to suboptimal selection strategies. Moreover, the nature of these interaction patterns remains largely unexplored, let alone optimize the instruction set with regard to them. To fill these gaps, in this paper, we: (1) systemically investigate interaction and dependency patterns between different categories of instructions, (2) manage to optimize the instruction set concerning the interaction patterns using a linear programming-based method, and optimize the learning schema of SFT using an instruction…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsOnline and Blended Learning · Education and Learning Interventions · Online Learning and Analytics

MethodsSparse Evolutionary Training · Shrink and Fine-Tune