AI red-teaming is a sociotechnical problem: on values, labor, and harms
Tarleton Gillespie, Ryland Shaw, Mary L. Gray, Jina Suh

TL;DR
This paper emphasizes the sociotechnical aspects of AI red-teaming, advocating for interdisciplinary research to understand its values, labor, and societal impacts to improve safety practices.
Contribution
It highlights the need for collaboration between computer and social scientists to study red-teaming's social and ethical dimensions in AI safety.
Findings
Red-teaming involves complex sociotechnical systems.
Labor and psychological impacts are significant in red-teaming work.
Understanding values behind red-teaming can improve safety practices.
Abstract
As generative AI technologies find more and more real-world applications, the importance of testing their performance and safety seems paramount. "Red-teaming" has quickly become the primary approach to test AI models--prioritized by AI companies, and enshrined in AI policy and regulation. Members of red teams act as adversaries, probing AI systems to test their safety mechanisms and uncover vulnerabilities. Yet we know far too little about this work or its implications. This essay calls for collaboration between computer scientists and social scientists to study the sociotechnical systems surrounding AI technologies, including the work of red-teaming, to avoid repeating the mistakes of the recent past. We highlight the importance of understanding the values and assumptions behind red-teaming, the labor arrangements involved, and the psychological impacts on red-teamers, drawing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
