AuditLLM: A Tool for Auditing Large Language Models Using Multiprobe Approach
Maryam Amirizaniani, Elias Martin, Tanya Roosta, Aman Chadha, Chirag, Shah

TL;DR
AuditLLM is a user-friendly tool that systematically probes large language models with multiple variations of a question to detect inconsistencies, bias, and hallucinations, thereby improving model reliability and safety.
Contribution
The paper introduces AuditLLM, a novel, easy-to-use auditing tool that employs a multiprobe approach to evaluate LLM consistency and identify potential issues.
Findings
Detects inconsistencies indicating bias or hallucinations
Provides interpretable results reflecting model performance
Supports real-time and batch auditing modes
Abstract
As Large Language Models (LLMs) are integrated into various sectors, ensuring their reliability and safety is crucial. This necessitates rigorous probing and auditing to maintain their effectiveness and trustworthiness in practical applications. Subjecting LLMs to varied iterations of a single query can unveil potential inconsistencies in their knowledge base or functional capacity. However, a tool for performing such audits with a easy to execute workflow, and low technical threshold is lacking. In this demo, we introduce ``AuditLLM,'' a novel tool designed to audit the performance of various LLMs in a methodical way. AuditLLM's primary function is to audit a given LLM by deploying multiple probes derived from a single question, thus detecting any inconsistencies in the model's comprehension or performance. A robust, reliable, and consistent LLM is expected to generate semantically…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques
MethodsBalanced Selection
