AgentInstruct: Toward Generative Teaching with Agentic Flows
Arindam Mitra, Luciano Del Corro, Guoqing Zheng, Shweti Mahajan, Dany, Rouhana, Andres Codas, Yadong Lu, Wei-ge Chen, Olga Vrousgos, Corby Rosset,, Fillipe Silva, Hamed Khanpour, Yash Lara, Ahmed Awadallah

TL;DR
This paper introduces AgentInstruct, an agentic framework for automatically generating high-quality synthetic data to improve language model training, demonstrating significant performance gains across multiple benchmarks.
Contribution
The paper presents AgentInstruct, a novel extensible framework for automatic synthetic data creation for language models, reducing human effort and enhancing diversity and quality.
Findings
Generated 25 million data pairs for instruction tuning.
Post-trained Mistral-7b with the synthetic data, achieving significant benchmark improvements.
Outperformed models like LLAMA-8B-instruct and GPT-3.5-turbo.
Abstract
Synthetic data is becoming increasingly important for accelerating the development of language models, both large and small. Despite several successful use cases, researchers also raised concerns around model collapse and drawbacks of imitating other models. This discrepancy can be attributed to the fact that synthetic data varies in quality and diversity. Effective use of synthetic data usually requires significant human effort in curating the data. We focus on using synthetic data for post-training, specifically creating data by powerful models to teach a new skill or behavior to another model, we refer to this setting as Generative Teaching. We introduce AgentInstruct, an extensible agentic framework for automatically creating large amounts of diverse and high-quality synthetic data. AgentInstruct can create both the prompts and responses, using only raw data sources like text…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMulti-Agent Systems and Negotiation
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · {Dispute@FaQ-s}How to file a dispute with Expedia? · 15 Ways to Contact How can i speak to someone at Delta Airlines · Attention Is All You Need · Linear Layer · Adam · Dropout · Dense Connections · Weight Decay · Multi-Head Attention
