Insights on Disagreement Patterns in Multimodal Safety Perception across   Diverse Rater Groups

Charvi Rastogi; Tian Huey Teh; Pushkar Mishra; Roma Patel; Zoe; Ashwood; Aida Mostafazadeh Davani; Mark Diaz; Michela Paganini; Alicia; Parrish; Ding Wang; Vinodkumar Prabhakaran; Lora Aroyo; Verena Rieser

arXiv:2410.17032·cs.AI·October 23, 2024

Insights on Disagreement Patterns in Multimodal Safety Perception across Diverse Rater Groups

Charvi Rastogi, Tian Huey Teh, Pushkar Mishra, Roma Patel, Zoe, Ashwood, Aida Mostafazadeh Davani, Mark Diaz, Michela Paganini, Alicia, Parrish, Ding Wang, Vinodkumar Prabhakaran, Lora Aroyo, Verena Rieser

PDF

Open Access

TL;DR

This study reveals significant demographic differences in safety perceptions of generative AI, emphasizing the importance of diverse perspectives for inclusive safety evaluation.

Contribution

It provides a large-scale analysis of demographic variations in safety ratings for text-to-image AI, highlighting differences from expert assessments and previous text safety studies.

Findings

01

Significant differences in harm severity assessments across demographic groups.

02

Diverse raters produce annotation patterns different from experts.

03

Differences in safety perceptions are distinct from text-based safety group differences.

Abstract

AI systems crucially rely on human ratings, but these ratings are often aggregated, obscuring the inherent diversity of perspectives in real-world phenomenon. This is particularly concerning when evaluating the safety of generative AI, where perceptions and associated harms can vary significantly across socio-cultural contexts. While recent research has studied the impact of demographic differences on annotating text, there is limited understanding of how these subjective variations affect multimodal safety in generative AI. To address this, we conduct a large-scale study employing highly-parallel safety ratings of about 1000 text-to-image (T2I) generations from a demographically diverse rater pool of 630 raters balanced across 30 intersectional groups across age, gender, and ethnicity. Our study shows that (1) there are significant differences across demographic groups (including…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRisk Perception and Management

MethodsSparse Evolutionary Training