AuditLLM: A Tool for Auditing Large Language Models Using Multiprobe   Approach

Maryam Amirizaniani; Elias Martin; Tanya Roosta; Aman Chadha; Chirag; Shah

arXiv:2402.09334·cs.AI·June 19, 2024·3 cites

AuditLLM: A Tool for Auditing Large Language Models Using Multiprobe Approach

Maryam Amirizaniani, Elias Martin, Tanya Roosta, Aman Chadha, Chirag, Shah

PDF

Open Access

TL;DR

AuditLLM is a user-friendly tool that systematically probes large language models with multiple variations of a question to detect inconsistencies, bias, and hallucinations, thereby improving model reliability and safety.

Contribution

The paper introduces AuditLLM, a novel, easy-to-use auditing tool that employs a multiprobe approach to evaluate LLM consistency and identify potential issues.

Findings

01

Detects inconsistencies indicating bias or hallucinations

02

Provides interpretable results reflecting model performance

03

Supports real-time and batch auditing modes

Abstract

As Large Language Models (LLMs) are integrated into various sectors, ensuring their reliability and safety is crucial. This necessitates rigorous probing and auditing to maintain their effectiveness and trustworthiness in practical applications. Subjecting LLMs to varied iterations of a single query can unveil potential inconsistencies in their knowledge base or functional capacity. However, a tool for performing such audits with a easy to execute workflow, and low technical threshold is lacking. In this demo, we introduce ``AuditLLM,'' a novel tool designed to audit the performance of various LLMs in a methodical way. AuditLLM's primary function is to audit a given LLM by deploying multiple probes derived from a single question, thus detecting any inconsistencies in the model's comprehension or performance. A robust, reliable, and consistent LLM is expected to generate semantically…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques

MethodsBalanced Selection