A Proposal for Evaluating the Operational Risk for ChatBots based on   Large Language Models

Pedro Pinacho-Davidson; Fernando Gutierrez; Pablo Zapata; Rodolfo; Vergara; Pablo Aqueveque

arXiv:2505.04784·cs.CR·May 9, 2025

A Proposal for Evaluating the Operational Risk for ChatBots based on Large Language Models

Pedro Pinacho-Davidson, Fernando Gutierrez, Pablo Zapata, Rodolfo, Vergara, Pablo Aqueveque

PDF

Open Access

TL;DR

This paper introduces a comprehensive risk assessment metric for evaluating operational risks in chatbots powered by Large Language Models, considering technical, contextual, and stakeholder-specific factors to improve security and reliability.

Contribution

It proposes a novel, multi-dimensional risk assessment framework tailored for LLM-based chatbots, validated with an enhanced open-source vulnerability testing tool.

Findings

01

Risk scores guide mitigation strategies

02

Enhanced detection of misinformation and malicious behaviors

03

Demonstrated in retrieval-augmented generation chatbot scenarios

Abstract

The emergence of Generative AI (Gen AI) and Large Language Models (LLMs) has enabled more advanced chatbots capable of human-like interactions. However, these conversational agents introduce a broader set of operational risks that extend beyond traditional cybersecurity considerations. In this work, we propose a novel, instrumented risk-assessment metric that simultaneously evaluates potential threats to three key stakeholders: the service-providing organization, end users, and third parties. Our approach incorporates the technical complexity required to induce erroneous behaviors in the chatbot--ranging from non-induced failures to advanced prompt-injection attacks--as well as contextual factors such as the target industry, user age range, and vulnerability severity. To validate our metric, we leverage Garak, an open-source framework for LLM vulnerability testing. We further enhance…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAI in Service Interactions · Artificial Intelligence in Healthcare and Education · Spam and Phishing Detection

MethodsSparse Evolutionary Training