Know Your Mistakes: Towards Preventing Overreliance on Task-Oriented Conversational AI Through Accountability Modeling
Suvodip Dey, Yi-Jyun Sun, Gokhan Tur, Dilek Hakkani-Tur

TL;DR
This paper introduces an accountability model with an additional classifier head for task-oriented dialogue LLMs to detect errors, improve accuracy, and reduce user overreliance through friction turns and self-correction.
Contribution
It proposes a novel accountability model with an added classifier head for error detection and correction in dialogue systems, enhancing accuracy and user trust.
Findings
3% improvement in joint goal accuracy (JGA) with accountability heads
State-of-the-art DST performance achieved through self-correction
Error correction via user confirmations reduces overreliance
Abstract
Recent LLMs have enabled significant advancements for conversational agents. However, they are also well known to hallucinate, producing responses that seem plausible but are factually incorrect. On the other hand, users tend to over-rely on LLM-based AI agents, accepting AI's suggestion even when it is wrong. Adding positive friction, such as explanations or getting user confirmations, has been proposed as a mitigation in AI-supported decision-making systems. In this paper, we propose an accountability model for LLM-based task-oriented dialogue agents to address user overreliance via friction turns in cases of model uncertainty and errors associated with dialogue state tracking (DST). The accountability model is an augmented LLM with an additional accountability head that functions as a binary classifier to predict the relevant slots of the dialogue state mentioned in the conversation.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Ethics and Social Impacts of AI
MethodsDynamic Sparse Training
