Ask What Your Country Can Do For You: Towards a Public Red Teaming Model

Wm. Matthew Kennedy; Cigdem Patlak; Jayraj Dave; Blake Chambers; Aayush Dhanotiya; Darshini Ramiah; Reva Schwartz; Jack Hagen; Akash Kundu; Mouni Pendharkar; Liam Baisley; Theodora Skeadas; Rumman Chowdhury

arXiv:2510.20061·cs.CY·October 24, 2025

Ask What Your Country Can Do For You: Towards a Public Red Teaming Model

Wm. Matthew Kennedy, Cigdem Patlak, Jayraj Dave, Blake Chambers, Aayush Dhanotiya, Darshini Ramiah, Reva Schwartz, Jack Hagen, Akash Kundu, Mouni Pendharkar, Liam Baisley, Theodora Skeadas, Rumman Chowdhury

PDF

Open Access

TL;DR

This paper proposes a public red-teaming model for AI systems, aiming to improve oversight and responsibility by involving the public in adversarial evaluation, with early pilot results demonstrating its potential effectiveness and scalability.

Contribution

It introduces a novel cooperative public AI red-teaming approach, integrating it with live demonstrations and pilot exercises to enhance AI risk assessment and oversight.

Findings

01

Early pilot exercises show promising results in identifying AI risks.

02

The approach is scalable to various jurisdictions and sectors.

03

Public involvement enhances AI safety evaluation.

Abstract

AI systems have the potential to produce both benefits and harms, but without rigorous and ongoing adversarial evaluation, AI actors will struggle to assess the breadth and magnitude of the AI risk surface. Researchers from the field of systems design have developed several effective sociotechnical AI evaluation and red teaming techniques targeting bias, hate speech, mis/disinformation, and other documented harm classes. However, as increasingly sophisticated AI systems are released into high-stakes sectors (such as education, healthcare, and intelligence-gathering), our current evaluation and monitoring methods are proving less and less capable of delivering effective oversight. In order to actually deliver responsible AI and to ensure AI's harms are fully understood and its security vulnerabilities mitigated, pioneering new approaches to close this "responsibility gap" are now more…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEthics and Social Impacts of AI · Misinformation and Its Impacts · Spam and Phishing Detection