SeeGULL: A Stereotype Benchmark with Broad Geo-Cultural Coverage Leveraging Generative Models
Akshita Jha, Aida Davani, Chandan K. Reddy, Shachi Dave, Vinodkumar, Prabhakaran, Sunipa Dev

TL;DR
SeeGULL is a comprehensive, globally diverse stereotype dataset created using large language models and diverse raters, addressing the Western-centric bias of existing datasets in NLP.
Contribution
The paper introduces SeeGULL, a broad-coverage stereotype benchmark with global representation, generated with LLMs and validated by diverse raters, filling a major gap in stereotype datasets.
Findings
SeeGULL covers stereotypes from 178 countries across 8 regions.
Global disparities in stereotypes are demonstrated through offensive scores.
Regional differences in stereotypes are identified between local and North American annotators.
Abstract
Stereotype benchmark datasets are crucial to detect and mitigate social stereotypes about groups of people in NLP models. However, existing datasets are limited in size and coverage, and are largely restricted to stereotypes prevalent in the Western society. This is especially problematic as language technologies gain hold across the globe. To address this gap, we present SeeGULL, a broad-coverage stereotype dataset, built by utilizing generative capabilities of large language models such as PaLM, and GPT-3, and leveraging a globally diverse rater pool to validate the prevalence of those stereotypes in society. SeeGULL is in English, and contains stereotypes about identity groups spanning 178 countries across 8 different geo-political regions across 6 continents, as well as state-level identities within the US and India. We also include fine-grained offensiveness scores for different…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHate Speech and Cyberbullying Detection
Methods15 Ways to Contact How can i speak to someone at Delta Airlines · Multi-Head Attention · Attention Is All You Need · Cosine Annealing · Dropout · Linear Layer · Dense Connections · Attention Dropout · Adam · Residual Connection
