Seeing Seeds Beyond Weeds: Green Teaming Generative AI for Beneficial Uses
Logan Stapleton, Jordan Taylor, Sarah Fox, Tongshuang Wu, Haiyi Zhu

TL;DR
This paper introduces green teaming, a method for bypassing generative AI content filters to enable beneficial applications, challenging traditional notions of harm and value in AI content moderation.
Contribution
It presents green teaming as a novel approach to design and critique generative AI systems for positive uses, demonstrated through diverse practical examples.
Findings
Green teaming enables beneficial AI use cases previously restricted by filters.
Examples include AI-assisted mental health training, debugging education, and political satire.
Green teaming serves as both a design tool and a mode of critique.
Abstract
Large generative AI models (GMs) like GPT and DALL-E are trained to generate content for general, wide-ranging purposes. GM content filters are generalized to filter out content which has a risk of harm in many cases, e.g., hate speech. However, prohibited content is not always harmful -- there are instances where generating prohibited content can be beneficial. So, when GMs filter out content, they preclude beneficial use cases along with harmful ones. Which use cases are precluded reflects the values embedded in GM content filtering. Recent work on red teaming proposes methods to bypass GM content filters to generate harmful content. We coin the term green teaming to describe methods of bypassing GM content filters to design for beneficial use cases. We showcase green teaming by: 1) Using ChatGPT as a virtual patient to simulate a person experiencing suicidal ideation, for suicide…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEthics and Social Impacts of AI · Digital Mental Health Interventions
MethodsMulti-Head Attention · Attention Is All You Need · Cosine Annealing · Linear Layer · Adam · Attention Dropout · Layer Normalization · Byte Pair Encoding · Softmax · Refunds@Expedia|||How do I get a full refund from Expedia?
