Automatic Instruction Evolving for Large Language Models

Weihao Zeng; Can Xu; Yingxiu Zhao; Jian-Guang Lou; Weizhu Chen

arXiv:2406.00770·cs.CL·June 4, 2024·3 cites

Automatic Instruction Evolving for Large Language Models

Weihao Zeng, Can Xu, Yingxiu Zhao, Jian-Guang Lou, Weizhu Chen

PDF

Open Access 1 Repo 1 Video

TL;DR

Auto Evol-Instruct is an automated framework that evolves instruction datasets for large language models, outperforming human-designed methods without requiring human expertise.

Contribution

It introduces an end-to-end automated approach for evolving instruction datasets using large language models, reducing reliance on human-designed strategies.

Findings

01

Outperforms human-designed methods on multiple benchmarks

02

Automatically analyzes and summarizes evolutionary strategies

03

Iteratively improves instruction evolution process

Abstract

Fine-tuning large pre-trained language models with Evol-Instruct has achieved encouraging results across a wide range of tasks. However, designing effective evolving methods for instruction evolution requires substantial human expertise. This paper proposes Auto Evol-Instruct, an end-to-end framework that evolves instruction datasets using large language models without any human effort. The framework automatically analyzes and summarizes suitable evolutionary strategies for the given instruction data and iteratively improves the evolving method based on issues exposed during the instruction evolution process. Our extensive experiments demonstrate that the best method optimized by Auto Evol-Instruct outperforms human-designed methods on various benchmarks, including MT-Bench, AlpacaEval, GSM8K, and HumanEval.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

arcee-ai/EvolKit
none

Videos

Automatic Instruction Evolving for Large Language Models· underline

Taxonomy

TopicsIntelligent Tutoring Systems and Adaptive Learning · Natural Language Processing Techniques · Online Learning and Analytics