ReviewInstruct: A Review-Driven Multi-Turn Conversations Generation Method for Large Language Models

Jiangxu Wu; Cong Wang; TianHuang Su; Jun Yang; Haozhi Lin; Chao Zhang; Ming Peng; Kai Shi; SongPan Yang; BinQing Pan; ZiXian Li; Ni Yang; ZhenYu Yang

arXiv:2505.11010·cs.CL·July 8, 2025

ReviewInstruct: A Review-Driven Multi-Turn Conversations Generation Method for Large Language Models

Jiangxu Wu, Cong Wang, TianHuang Su, Jun Yang, Haozhi Lin, Chao Zhang, Ming Peng, Kai Shi, SongPan Yang, BinQing Pan, ZiXian Li, Ni Yang, ZhenYu Yang

PDF

Open Access 1 Repo

TL;DR

ReviewInstruct introduces an iterative multi-agent framework that synthesizes diverse, high-quality multi-turn dialogues for large language models, significantly improving their contextual coherence and instruction quality.

Contribution

It proposes a novel review-driven multi-agent method for generating multi-turn dialogue data, enhancing diversity and difficulty for LLM fine-tuning.

Findings

01

Achieves 2.9% improvement on MMLU-Pro

02

Achieves 2% improvement on MT-Bench

03

Demonstrates the effectiveness of review stages and multiple reviewers

Abstract

The effectiveness of large language models (LLMs) in conversational AI is hindered by their reliance on single-turn supervised fine-tuning (SFT) data, which limits contextual coherence in multi-turn dialogues. Existing methods for generating multi-turn dialogue data struggle to ensure both diversity and quality in instructions. To address this, we propose Review-Instruct, a novel framework that synthesizes multi-turn conversations through an iterative "Ask-Respond-Review" process involving three agent roles: a Candidate, multiple Reviewers, and a Chairman. The framework iteratively refines instructions by incorporating Reviewer feedback, enhancing dialogue diversity and difficulty. We construct a multi-turn dataset using the Alpaca dataset and fine-tune the LLaMA2-13B model. Evaluations on MT-Bench, MMLU-Pro, and Auto-Arena demonstrate significant improvements, achieving absolute gains…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

wjx-git/Review-Instruct
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Text Readability and Simplification · Natural Language Processing Techniques