Supporting Human-AI Collaboration in Auditing LLMs with LLMs
Charvi Rastogi, Marco Tulio Ribeiro, Nicholas King, Harsha Nori,, Saleema Amershi

TL;DR
This paper introduces AdaTest++, an enhanced human-AI collaborative auditing tool for large language models, emphasizing sensemaking and communication to improve failure detection and understanding.
Contribution
It advances LLM auditing by integrating human-AI collaboration principles into AdaTest, demonstrating improved failure detection and reporting capabilities.
Findings
AdaTest++ effectively leverages human strengths in sensemaking and hypothesis testing.
Participants identified diverse failure modes, including under-reported issues.
The tool improves detection of biases and errors in commercial LLMs.
Abstract
Large language models are becoming increasingly pervasive and ubiquitous in society via deployment in sociotechnical systems. Yet these language models, be it for classification or generation, have been shown to be biased and behave irresponsibly, causing harm to people at scale. It is crucial to audit these language models rigorously. Existing auditing tools leverage either or both humans and AI to find failures. In this work, we draw upon literature in human-AI collaboration and sensemaking, and conduct interviews with research experts in safe and fair AI, to build upon the auditing tool: AdaTest (Ribeiro and Lundberg, 2022), which is powered by a generative large language model (LLM). Through the design process we highlight the importance of sensemaking and human-AI communication to leverage complementary strengths of humans and generative models in collaborative auditing. To…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEthics and Social Impacts of AI · Explainable Artificial Intelligence (XAI) · Artificial Intelligence in Healthcare and Education
MethodsMulti-Head Attention · Attention Is All You Need · Cosine Annealing · Linear Layer · 15 Ways to Contact How can i speak to someone at Delta Airlines · Dense Connections · {Dispute@FaQ-s}How to file a dispute with Expedia? · Adam · Attention Dropout · Linear Warmup With Cosine Annealing
