Ada-Instruct: Adapting Instruction Generators for Complex Reasoning
Wanyun Cui, Qianle Wang

TL;DR
Ada-Instruct enhances instruction generation for complex reasoning tasks by fine-tuning open source LLMs with minimal data, enabling the creation of long, intricate instructions that surpass previous methods in complexity and consistency.
Contribution
This paper introduces Ada-Instruct, a novel fine-tuning approach that produces complex, long instructions for reasoning tasks using only ten examples, addressing limitations of prior self-instruct methods.
Findings
Ada-Instruct generates instructions of length ≥ 100 for complex tasks.
It maintains distributional consistency across diverse applications.
It outperforms existing methods in creating intricate instructions.
Abstract
Instructions augmentation is a crucial step for unleashing the full potential of large language models (LLMs) in downstream tasks. Existing Self-Instruct methods primarily simulate new instructions from a few initial instructions with in-context learning. However, our study identifies a critical flaw in this approach: even with GPT4o, Self-Instruct cannot generate complex instructions of length , which is necessary in complex tasks such as code completion. To address this issue, our key insight is that fine-tuning open source LLMs with only ten examples can produce complex instructions that maintain distributional consistency for complex reasoning tasks. We introduce Ada-Instruct, an adaptive instruction generator developed through fine-tuning. We empirically validated Ada-Instruct's efficacy across different applications. The results highlight Ada-Instruct's capacity to…
Peer Reviews
Decision·Submitted to ICLR 2024
1. This paper proposes a novel self-instruct method by finetuning open-sourced LLMs to generate instruction. 2. The insight is impressive that current self-instruct methods (ICL) prefer to generate short instructions which will lead to a distribution mismatch. 3. The paper is well written.
1. In terms of innovation, the authors seem to have some misconceptions. Specifically, there have been previous works that used open-source models to generate instructions, such as the use of the open-sourced LLM Llama in [1], rather than ChatGPT or GPT-4. So what the authors mentioned in the introduction is not true: >A prevalent approach is called “self-instruct” (Wang et al., 2022), which involves having ChatGPT sequentially generate both instructions and answers (Sun et al., 2023; Peng et
The proposed method is very simple and effective, and gets to generate diverse, complex queries for constructing instruction tuning datasets. This is an effective method to extrapolate training data from models to improve domain specific instruction tuning.
**Fair comparison is lacking**: The Table 1 does not present an apple-to-apple comparison, where Code LLAMA-Insturct utilizes different amount of data from Ada-Instruct-HumanEval or Ada-Instruct-MBPP. A fair comparison will be to compare self-instruct directly with Ada-instruct by controlling the amount of initial data and SFT data. **Comparison to Evo-instruct is lacking**: Though Evo-instruct seems to generat unnatural prompt, it has shown significant improvement over normal prompting. It’
- The proposed Ada-Instruct method leverages open-source models for instruction generation, reducing reliance on closed-source large models, which can lower the cost of training task-specific models. - Ada-Instruct outperforms self-instruct on well-controlled math and commonsense reasoning tasks, highlighting the method's effectiveness. - The paper compares the fine-tuned model with instructions generated via self-instruction, particularly exploring instruction quality and the impact on SFT (sup
- The exploration of which instructions are useful for sft is not sufficiently clear. - The paper initially points out that the issue with self-instruction is the limited length of generated instructions. However, later experiments show that Evol-Instruct with longer instructions does not perform well. The authors attribute this to "unnatural" instructions that do not align with downstream task distributions, but lack experimental validation. The authors can rewrite these instructions with op
Code & Models
- 🤗NousResearch/Genstruct-7Bmodel· 66 dl· ♡ 40366 dl♡ 403
- 🤗Goekdeniz-Guelmez/NousResearch-Genstruct-7B-GGUFmodel· 97 dl· ♡ 397 dl♡ 3
- 🤗Goekdeniz-Guelmez/NousResearch-Genstruct-7B-only-GGUFmodel· 61 dl· ♡ 161 dl♡ 1
- 🤗koesn/Genstruct-7B-GGUFmodel· 15 dl15 dl
- 🤗solidrust/Genstruct-7B-AWQmodel· 3 dl· ♡ 13 dl♡ 1
- 🤗Joseph717171/Genstruct-10.7Bmodel· 8 dl8 dl
- 🤗RichardErkhov/NousResearch_-_Genstruct-7B-4bitsmodel· 2 dl2 dl
- 🤗RichardErkhov/NousResearch_-_Genstruct-7B-8bitsmodel· 2 dl2 dl
- 🤗RichardErkhov/NousResearch_-_Genstruct-7B-ggufmodel· 78 dl78 dl
- 🤗iknow-lab/ko-genstruct-v0.1model· 4 dl· ♡ 24 dl♡ 2
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Software Engineering Research
MethodsBalanced Selection
