Evaluating and Enhancing Out-of-Domain Generalization of Task-Oriented Dialog Systems for Task Completion without Turn-level Dialog Annotations
Adib Mosharrof, Moghis Fereidouni, A.B. Siddique

TL;DR
This paper investigates fine-tuning large language models on natural dialogs without turn-level annotations to improve out-of-domain task-oriented dialog systems, proposing a schema augmentation framework to enhance task completion.
Contribution
It introduces ZeroToD, a novel schema augmentation framework that improves API call accuracy and task completion in out-of-domain dialogs without requiring turn-level annotations.
Findings
Fine-tuned models generate coherent responses but have limited task success (~53%) in unseen domains.
ZeroToD improves API call accuracy and outperforms prompting-based approaches.
Human evaluation confirms the effectiveness of the proposed framework.
Abstract
Traditional task-oriented dialog (ToD) systems rely heavily on labor-intensive turn-level annotations, such as dialogue states and policy labels, for training. This work explores whether large language models (LLMs) can be fine-tuned solely on natural language dialogs to perform ToD tasks, without requiring such annotations. We evaluate their ability to generalize to unseen domains and compare their performance with models trained on fully annotated data. Through extensive experiments with three open-source LLMs of varying sizes and two diverse ToD datasets, we find that models fine-tuned without turn-level annotations generate coherent and contextually appropriate responses. However, their task completion performance - measured by accurate execution of API calls - remains suboptimal, with the best models achieving only around 53% success in unseen domains. To improve task completion,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and dialogue systems · Service-Oriented Architecture and Web Services · AI in Service Interactions
