A Proposal for Evaluating the Operational Risk for ChatBots based on Large Language Models
Pedro Pinacho-Davidson, Fernando Gutierrez, Pablo Zapata, Rodolfo, Vergara, Pablo Aqueveque

TL;DR
This paper introduces a comprehensive risk assessment metric for evaluating operational risks in chatbots powered by Large Language Models, considering technical, contextual, and stakeholder-specific factors to improve security and reliability.
Contribution
It proposes a novel, multi-dimensional risk assessment framework tailored for LLM-based chatbots, validated with an enhanced open-source vulnerability testing tool.
Findings
Risk scores guide mitigation strategies
Enhanced detection of misinformation and malicious behaviors
Demonstrated in retrieval-augmented generation chatbot scenarios
Abstract
The emergence of Generative AI (Gen AI) and Large Language Models (LLMs) has enabled more advanced chatbots capable of human-like interactions. However, these conversational agents introduce a broader set of operational risks that extend beyond traditional cybersecurity considerations. In this work, we propose a novel, instrumented risk-assessment metric that simultaneously evaluates potential threats to three key stakeholders: the service-providing organization, end users, and third parties. Our approach incorporates the technical complexity required to induce erroneous behaviors in the chatbot--ranging from non-induced failures to advanced prompt-injection attacks--as well as contextual factors such as the target industry, user age range, and vulnerability severity. To validate our metric, we leverage Garak, an open-source framework for LLM vulnerability testing. We further enhance…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI in Service Interactions · Artificial Intelligence in Healthcare and Education · Spam and Phishing Detection
MethodsSparse Evolutionary Training
