Seeing Seeds Beyond Weeds: Green Teaming Generative AI for Beneficial   Uses

Logan Stapleton; Jordan Taylor; Sarah Fox; Tongshuang Wu; Haiyi Zhu

arXiv:2306.03097·cs.HC·June 7, 2023·5 cites

Seeing Seeds Beyond Weeds: Green Teaming Generative AI for Beneficial Uses

Logan Stapleton, Jordan Taylor, Sarah Fox, Tongshuang Wu, Haiyi Zhu

PDF

Open Access

TL;DR

This paper introduces green teaming, a method for bypassing generative AI content filters to enable beneficial applications, challenging traditional notions of harm and value in AI content moderation.

Contribution

It presents green teaming as a novel approach to design and critique generative AI systems for positive uses, demonstrated through diverse practical examples.

Findings

01

Green teaming enables beneficial AI use cases previously restricted by filters.

02

Examples include AI-assisted mental health training, debugging education, and political satire.

03

Green teaming serves as both a design tool and a mode of critique.

Abstract

Large generative AI models (GMs) like GPT and DALL-E are trained to generate content for general, wide-ranging purposes. GM content filters are generalized to filter out content which has a risk of harm in many cases, e.g., hate speech. However, prohibited content is not always harmful -- there are instances where generating prohibited content can be beneficial. So, when GMs filter out content, they preclude beneficial use cases along with harmful ones. Which use cases are precluded reflects the values embedded in GM content filtering. Recent work on red teaming proposes methods to bypass GM content filters to generate harmful content. We coin the term green teaming to describe methods of bypassing GM content filters to design for beneficial use cases. We showcase green teaming by: 1) Using ChatGPT as a virtual patient to simulate a person experiencing suicidal ideation, for suicide…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEthics and Social Impacts of AI · Digital Mental Health Interventions

MethodsMulti-Head Attention · Attention Is All You Need · Cosine Annealing · Linear Layer · Adam · Attention Dropout · Layer Normalization · Byte Pair Encoding · Softmax · Refunds@Expedia|||How do I get a full refund from Expedia?