Evaluation and Continual Improvement for an Enterprise AI Assistant
Akash V. Maharaj, Kun Qian, Uttaran Bhattacharya, Sally Fang, Horia, Galatanu, Manas Garg, Rachel Hanessian, Nishant Kapoor, Ken Russell,, Shivakumar Vaithyanathan, and Yunyao Li

TL;DR
This paper discusses the challenges in evaluating and improving enterprise AI assistants, sharing preliminary results and lessons learned to guide iterative development of conversational AI systems.
Contribution
It introduces a framework for evaluating and continually improving enterprise AI assistants, addressing specific challenges in their iterative development process.
Findings
Identified key challenges in evaluating enterprise AI assistants.
Shared preliminary results on improvement strategies.
Discussed lessons learned for future development.
Abstract
The development of conversational AI assistants is an iterative process with multiple components. As such, the evaluation and continual improvement of these assistants is a complex and multifaceted problem. This paper introduces the challenges in evaluating and improving a generative AI assistant for enterprises, which is under active development, and how we address these challenges. We also share preliminary results and discuss lessons learned.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDigital Transformation in Industry
