Uncertainty Measures in Neural Belief Tracking and the Effects on Dialogue Policy Performance
Carel van Niekerk, Andrey Malinin, Christian Geishauser, Michael Heck,, Hsien-chin Lin, Nurul Lubis, Shutong Feng, Milica Ga\v{s}i\'c

TL;DR
This paper explores how incorporating various uncertainty measures into neural belief tracking improves the robustness and performance of dialogue policies, addressing a gap in current neural dialogue systems that often ignore uncertainty.
Contribution
It introduces different uncertainty measures into neural belief tracking and evaluates their impact on downstream dialogue policy performance using both simulated and real user interactions.
Findings
Uncertainty measures improve dialogue policy robustness.
Incorporating uncertainty enhances policy performance.
Neural belief trackers benefit from explicit uncertainty modeling.
Abstract
The ability to identify and resolve uncertainty is crucial for the robustness of a dialogue system. Indeed, this has been confirmed empirically on systems that utilise Bayesian approaches to dialogue belief tracking. However, such systems consider only confidence estimates and have difficulty scaling to more complex settings. Neural dialogue systems, on the other hand, rarely take uncertainties into account. They are therefore overconfident in their decisions and less robust. Moreover, the performance of the tracking task is often evaluated in isolation, without consideration of its effect on the downstream policy optimisation. We propose the use of different uncertainty measures in neural belief tracking. The effects of these measures on the downstream task of policy optimisation are evaluated by adding selected measures of uncertainty to the feature space of the policy and training…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and dialogue systems · Topic Modeling · Intelligent Tutoring Systems and Adaptive Learning
