Evaluation and Continual Improvement for an Enterprise AI Assistant

Akash V. Maharaj; Kun Qian; Uttaran Bhattacharya; Sally Fang; Horia; Galatanu; Manas Garg; Rachel Hanessian; Nishant Kapoor; Ken Russell,; Shivakumar Vaithyanathan; and Yunyao Li

arXiv:2407.12003·cs.HC·December 10, 2024

Evaluation and Continual Improvement for an Enterprise AI Assistant

Akash V. Maharaj, Kun Qian, Uttaran Bhattacharya, Sally Fang, Horia, Galatanu, Manas Garg, Rachel Hanessian, Nishant Kapoor, Ken Russell,, Shivakumar Vaithyanathan, and Yunyao Li

PDF

Open Access

TL;DR

This paper discusses the challenges in evaluating and improving enterprise AI assistants, sharing preliminary results and lessons learned to guide iterative development of conversational AI systems.

Contribution

It introduces a framework for evaluating and continually improving enterprise AI assistants, addressing specific challenges in their iterative development process.

Findings

01

Identified key challenges in evaluating enterprise AI assistants.

02

Shared preliminary results on improvement strategies.

03

Discussed lessons learned for future development.

Abstract

The development of conversational AI assistants is an iterative process with multiple components. As such, the evaluation and continual improvement of these assistants is a complex and multifaceted problem. This paper introduces the challenges in evaluating and improving a generative AI assistant for enterprises, which is under active development, and how we address these challenges. We also share preliminary results and discuss lessons learned.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDigital Transformation in Industry