LLM-Driven Multi-Turn Task-Oriented Dialogue Synthesis for Realistic Reasoning

Yu Zhu; Kai Yang

arXiv:2602.23610·cs.CL·March 2, 2026

LLM-Driven Multi-Turn Task-Oriented Dialogue Synthesis for Realistic Reasoning

Yu Zhu, Kai Yang

PDF

Open Access 1 Video

TL;DR

This paper introduces a novel LLM-driven framework for synthesizing multi-turn, task-oriented dialogues that reflect real-world reasoning scenarios, aiming to improve the evaluation and development of LLM reasoning abilities.

Contribution

It presents a trilevel optimization-based method for generating realistic, coherent dialogues and reasoning tasks, addressing limitations of existing benchmarks and data contamination issues.

Findings

01

Synthetic datasets introduce complex reasoning challenges.

02

Generated dialogues are contextually coherent and realistic.

03

Method enhances LLM reasoning evaluation and training.

Abstract

The reasoning capability of large language models (LLMs), defined as their ability to analyze, infer, and make decisions based on input information, is essential for building intelligent task-oriented dialogue systems. However, existing benchmarks do not sufficiently reflect the complexity of real-world scenarios, which limits their effectiveness in evaluating and enhancing LLM reasoning in practical contexts. Many current reasoning datasets are overly simplistic and abstract, often disconnected from realistic task flows, domain constraints, and operational rules, making it difficult to effectively evaluate LLMs' logical reasoning ability. In addition, data contamination from pretraining corpora undermines the reliability of evaluation results, and traditional crowdsourcing methods for dataset construction are labor-intensive and difficult to scale. To address these challenges, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

LLM-Driven Multi-Turn Task-Oriented Dialogue Synthesis for Realistic Reasoning· underline

Taxonomy

TopicsTopic Modeling · Speech and dialogue systems · Multimodal Machine Learning Applications