Insights on Disagreement Patterns in Multimodal Safety Perception across Diverse Rater Groups
Charvi Rastogi, Tian Huey Teh, Pushkar Mishra, Roma Patel, Zoe, Ashwood, Aida Mostafazadeh Davani, Mark Diaz, Michela Paganini, Alicia, Parrish, Ding Wang, Vinodkumar Prabhakaran, Lora Aroyo, Verena Rieser

TL;DR
This study reveals significant demographic differences in safety perceptions of generative AI, emphasizing the importance of diverse perspectives for inclusive safety evaluation.
Contribution
It provides a large-scale analysis of demographic variations in safety ratings for text-to-image AI, highlighting differences from expert assessments and previous text safety studies.
Findings
Significant differences in harm severity assessments across demographic groups.
Diverse raters produce annotation patterns different from experts.
Differences in safety perceptions are distinct from text-based safety group differences.
Abstract
AI systems crucially rely on human ratings, but these ratings are often aggregated, obscuring the inherent diversity of perspectives in real-world phenomenon. This is particularly concerning when evaluating the safety of generative AI, where perceptions and associated harms can vary significantly across socio-cultural contexts. While recent research has studied the impact of demographic differences on annotating text, there is limited understanding of how these subjective variations affect multimodal safety in generative AI. To address this, we conduct a large-scale study employing highly-parallel safety ratings of about 1000 text-to-image (T2I) generations from a demographically diverse rater pool of 630 raters balanced across 30 intersectional groups across age, gender, and ethnicity. Our study shows that (1) there are significant differences across demographic groups (including…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRisk Perception and Management
MethodsSparse Evolutionary Training
