Towards Zero-Shot, Controllable Dialog Planning with LLMs
Dirk V\"ath, Ngoc Thang Vu

TL;DR
This paper presents a zero-shot, controllable dialog planning method using LLMs and dialog graphs, outperforming RL-based agents in simulation and real user tasks, especially in sensitive domains.
Contribution
Introduces a novel zero-shot approach for controllable Conversational Tree Search using LLM-guided graph search, eliminating the need for extensive training.
Findings
Significantly outperforms state-of-the-art CTS agents in simulation ($p<0.0001$).
Improves task success in real user evaluation ($p<0.05$).
Generalizes across all CTS domains.
Abstract
Recently, Large Language Models (LLMs) have emerged as an alternative to training task-specific dialog agents, due to their broad reasoning capabilities and performance in zero-shot learning scenarios. However, many LLM-based dialog systems fall short in planning towards an overarching dialog goal and therefore cannot steer the conversation appropriately. Furthermore, these models struggle with hallucination, making them unsuitable for information access in sensitive domains, such as legal or medical domains, where correctness of information given to users is critical. The recently introduced task Conversational Tree Search (CTS) proposes the use of dialog graphs to avoid hallucination in sensitive domains, however, state-of-the-art agents are Reinforcement Learning (RL) based and require long training times, despite excelling at dialog strategy. This paper introduces a novel zero-shot…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Games
