Ada-Instruct: Adapting Instruction Generators for Complex Reasoning

Wanyun Cui; Qianle Wang

arXiv:2310.04484·cs.CL·October 4, 2024

Ada-Instruct: Adapting Instruction Generators for Complex Reasoning

Wanyun Cui, Qianle Wang

PDF

Open Access 1 Repo 10 Models 1 Datasets 3 Reviews

TL;DR

Ada-Instruct enhances instruction generation for complex reasoning tasks by fine-tuning open source LLMs with minimal data, enabling the creation of long, intricate instructions that surpass previous methods in complexity and consistency.

Contribution

This paper introduces Ada-Instruct, a novel fine-tuning approach that produces complex, long instructions for reasoning tasks using only ten examples, addressing limitations of prior self-instruct methods.

Findings

01

Ada-Instruct generates instructions of length ≥ 100 for complex tasks.

02

It maintains distributional consistency across diverse applications.

03

It outperforms existing methods in creating intricate instructions.

Abstract

Instructions augmentation is a crucial step for unleashing the full potential of large language models (LLMs) in downstream tasks. Existing Self-Instruct methods primarily simulate new instructions from a few initial instructions with in-context learning. However, our study identifies a critical flaw in this approach: even with GPT4o, Self-Instruct cannot generate complex instructions of length $\geq 100$ , which is necessary in complex tasks such as code completion. To address this issue, our key insight is that fine-tuning open source LLMs with only ten examples can produce complex instructions that maintain distributional consistency for complex reasoning tasks. We introduce Ada-Instruct, an adaptive instruction generator developed through fine-tuning. We empirically validated Ada-Instruct's efficacy across different applications. The results highlight Ada-Instruct's capacity to…

Peer Reviews

Decision·Submitted to ICLR 2024

Reviewer 01Rating 5· marginally below the acceptance thresholdConfidence 3

Strengths

1. This paper proposes a novel self-instruct method by finetuning open-sourced LLMs to generate instruction. 2. The insight is impressive that current self-instruct methods (ICL) prefer to generate short instructions which will lead to a distribution mismatch. 3. The paper is well written.

Weaknesses

1. In terms of innovation, the authors seem to have some misconceptions. Specifically, there have been previous works that used open-source models to generate instructions, such as the use of the open-sourced LLM Llama in [1], rather than ChatGPT or GPT-4. So what the authors mentioned in the introduction is not true: >A prevalent approach is called “self-instruct” (Wang et al., 2022), which involves having ChatGPT sequentially generate both instructions and answers (Sun et al., 2023; Peng et

Reviewer 02Rating 6· marginally above the acceptance thresholdConfidence 4

Strengths

The proposed method is very simple and effective, and gets to generate diverse, complex queries for constructing instruction tuning datasets. This is an effective method to extrapolate training data from models to improve domain specific instruction tuning.

Weaknesses

**Fair comparison is lacking**: The Table 1 does not present an apple-to-apple comparison, where Code LLAMA-Insturct utilizes different amount of data from Ada-Instruct-HumanEval or Ada-Instruct-MBPP. A fair comparison will be to compare self-instruct directly with Ada-instruct by controlling the amount of initial data and SFT data. **Comparison to Evo-instruct is lacking**: Though Evo-instruct seems to generat unnatural prompt, it has shown significant improvement over normal prompting. It’

Reviewer 03Rating 5· marginally below the acceptance thresholdConfidence 3

Strengths

- The proposed Ada-Instruct method leverages open-source models for instruction generation, reducing reliance on closed-source large models, which can lower the cost of training task-specific models. - Ada-Instruct outperforms self-instruct on well-controlled math and commonsense reasoning tasks, highlighting the method's effectiveness. - The paper compares the fine-tuned model with instructions generated via self-instruction, particularly exploring instruction quality and the impact on SFT (sup

Weaknesses

- The exploration of which instructions are useful for sft is not sufficiently clear. - The paper initially points out that the issue with self-instruction is the limited length of generated instructions. However, later experiments show that Evol-Instruct with longer instructions does not perform well. The authors attribute this to "unnatural" instructions that do not align with downstream task distributions, but lack experimental validation. The authors can rewrite these instructions with op

Code & Models

Repositories

wangitu/ada-instruct
pytorchOfficial

Models

Datasets

ahsanirfan961/genstruct-output
dataset· 107 dl
107 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Software Engineering Research

MethodsBalanced Selection