AI red-teaming is a sociotechnical problem: on values, labor, and harms

Tarleton Gillespie; Ryland Shaw; Mary L. Gray; Jina Suh

arXiv:2412.09751·cs.CY·January 9, 2026

AI red-teaming is a sociotechnical problem: on values, labor, and harms

Tarleton Gillespie, Ryland Shaw, Mary L. Gray, Jina Suh

PDF

TL;DR

This paper emphasizes the sociotechnical aspects of AI red-teaming, advocating for interdisciplinary research to understand its values, labor, and societal impacts to improve safety practices.

Contribution

It highlights the need for collaboration between computer and social scientists to study red-teaming's social and ethical dimensions in AI safety.

Findings

01

Red-teaming involves complex sociotechnical systems.

02

Labor and psychological impacts are significant in red-teaming work.

03

Understanding values behind red-teaming can improve safety practices.

Abstract

As generative AI technologies find more and more real-world applications, the importance of testing their performance and safety seems paramount. "Red-teaming" has quickly become the primary approach to test AI models--prioritized by AI companies, and enshrined in AI policy and regulation. Members of red teams act as adversaries, probing AI systems to test their safety mechanisms and uncover vulnerabilities. Yet we know far too little about this work or its implications. This essay calls for collaboration between computer scientists and social scientists to study the sociotechnical systems surrounding AI technologies, including the work of red-teaming, to avoid repeating the mistakes of the recent past. We highlight the importance of understanding the values and assumptions behind red-teaming, the labor arrangements involved, and the psychological impacts on red-teamers, drawing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.