Feedback-Based Self-Learning in Large-Scale Conversational AI Agents

Pragaash Ponnusamy; Alireza Roshan Ghias; Chenlei Guo; Ruhi Sarikaya

arXiv:1911.02557·cs.LG·November 7, 2019

Feedback-Based Self-Learning in Large-Scale Conversational AI Agents

Pragaash Ponnusamy, Alireza Roshan Ghias, Chenlei Guo, Ruhi Sarikaya

PDF

Open Access

TL;DR

This paper introduces a scalable, feedback-driven self-learning system for large-scale conversational AI that reduces errors by automatically detecting issues and generating reformulations without manual annotation, improving performance in real-world deployment.

Contribution

It presents a novel self-learning framework using user feedback and Markov Chain models to automate error correction in conversational AI at scale, without manual data annotation.

Findings

01

Reduces utterance error rate by over 30% in production.

02

Achieves a win/loss ratio of 11.8 in live tests.

03

Scalable approach leveraging anonymized user data across millions.

Abstract

Today, most large-scale conversational AI agents (e.g. Alexa, Siri, or Google Assistant) are built using manually annotated data to train the different components of the system. Typically, the accuracy of the ML models in these components are improved by manually transcribing and annotating data. As the scope of these systems increase to cover more scenarios and domains, manual annotation to improve the accuracy of these components becomes prohibitively costly and time consuming. In this paper, we propose a system that leverages user-system interaction feedback signals to automate learning without any manual annotation. Users here tend to modify a previous query in hopes of fixing an error in the previous turn to get the right results. These reformulations, which are often preceded by defective experiences caused by errors in ASR, NLU, ER or the application. In some cases, users may not…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and dialogue systems · Recommender Systems and Techniques · Topic Modeling