Contextual Bandit Applications in Customer Support Bot
Sandra Sajeev, Jade Huang, Nikos Karampatziakis, Matthew Hall,, Sebastian Kochman, and Weizhu Chen

TL;DR
This paper explores the deployment of contextual bandit algorithms in Microsoft's customer support virtual agent, demonstrating significant improvements in resolution rates and reduction in escalations through real-world implementations.
Contribution
It presents practical implementations of neural-linear and multi-armed bandit algorithms for intent disambiguation and recommendations in a production support bot, validated by A/B testing.
Findings
Over 12% increase in problem resolution rate
Over 4% decrease in escalations to human operators
Successful deployment of bandit algorithms in real-world customer support
Abstract
Virtual support agents have grown in popularity as a way for businesses to provide better and more accessible customer service. Some challenges in this domain include ambiguous user queries as well as changing support topics and user behavior (non-stationarity). We do, however, have access to partial feedback provided by the user (clicks, surveys, and other events) which can be leveraged to improve the user experience. Adaptable learning techniques, like contextual bandits, are a natural fit for this problem setting. In this paper, we discuss real-world implementations of contextual bandits (CB) for the Microsoft virtual agent. It includes intent disambiguation based on neural-linear bandits (NLB) and contextual recommendations based on a collection of multi-armed bandits (MAB). Our solutions have been deployed to production and have improved key business metrics of the Microsoft…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Methodstravel james
