Evaluating Chatbots to Promote Users' Trust -- Practices and Open   Problems

Biplav Srivastava; Kausik Lakkaraju; Tarmo Koppel; Vignesh Narayanan,; Ashish Kundu; Sachindra Joshi

arXiv:2309.05680·cs.HC·September 15, 2023·1 cites

Evaluating Chatbots to Promote Users' Trust -- Practices and Open Problems

Biplav Srivastava, Kausik Lakkaraju, Tarmo Koppel, Vignesh Narayanan,, Ashish Kundu, Sachindra Joshi

PDF

Open Access

TL;DR

This paper reviews current chatbot testing practices, highlights open problems affecting user trust, and suggests future directions to improve trustworthiness and societal impact of AI chatbots.

Contribution

It provides a comprehensive review of chatbot testing practices, identifies gaps, and outlines open problems to enhance user trust in AI chatbots.

Findings

01

Current testing practices are insufficient for ensuring trust.

02

Open problems include addressing societal and long-term impacts.

03

Recommendations for future research in chatbot trust enhancement.

Abstract

Chatbots, the common moniker for collaborative assistants, are Artificial Intelligence (AI) software that enables people to naturally interact with them to get tasks done. Although chatbots have been studied since the dawn of AI, they have particularly caught the imagination of the public and businesses since the launch of easy-to-use and general-purpose Large Language Model-based chatbots like ChatGPT. As businesses look towards chatbots as a potential technology to engage users, who may be end customers, suppliers, or even their own employees, proper testing of chatbots is important to address and mitigate issues of trust related to service or product performance, user satisfaction and long-term unintended consequences for society. This paper reviews current practices for chatbot testing, identifies gaps as open problems in pursuit of user trust, and outlines a path forward.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAI in Service Interactions · Ethics and Social Impacts of AI

Methodstravel james