LLM-Driven Multi-Turn Task-Oriented Dialogue Synthesis for Realistic Reasoning
Yu Zhu, Kai Yang

TL;DR
This paper introduces a novel LLM-driven framework for synthesizing multi-turn, task-oriented dialogues that reflect real-world reasoning scenarios, aiming to improve the evaluation and development of LLM reasoning abilities.
Contribution
It presents a trilevel optimization-based method for generating realistic, coherent dialogues and reasoning tasks, addressing limitations of existing benchmarks and data contamination issues.
Findings
Synthetic datasets introduce complex reasoning challenges.
Generated dialogues are contextually coherent and realistic.
Method enhances LLM reasoning evaluation and training.
Abstract
The reasoning capability of large language models (LLMs), defined as their ability to analyze, infer, and make decisions based on input information, is essential for building intelligent task-oriented dialogue systems. However, existing benchmarks do not sufficiently reflect the complexity of real-world scenarios, which limits their effectiveness in evaluating and enhancing LLM reasoning in practical contexts. Many current reasoning datasets are overly simplistic and abstract, often disconnected from realistic task flows, domain constraints, and operational rules, making it difficult to effectively evaluate LLMs' logical reasoning ability. In addition, data contamination from pretraining corpora undermines the reliability of evaluation results, and traditional crowdsourcing methods for dataset construction are labor-intensive and difficult to scale. To address these challenges, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsTopic Modeling · Speech and dialogue systems · Multimodal Machine Learning Applications
