DICES Dataset: Diversity in Conversational AI Evaluation for Safety
Lora Aroyo, Alex S. Taylor, Mark Diaz, Christopher M. Homan, Alicia, Parrish, Greg Serapio-Garcia, Vinodkumar Prabhakaran, Ding Wang

TL;DR
The DICES dataset provides a diverse, demographically annotated resource for evaluating safety in conversational AI, emphasizing the importance of capturing subjectivity and cultural variance in safety assessments.
Contribution
This paper introduces the DICES dataset, enabling detailed analysis of demographic influences and diversity in conversational AI safety evaluations.
Findings
DICES dataset includes demographic annotations and multiple ratings per item.
Allows analysis of safety ratings across different demographic groups.
Facilitates development of more inclusive safety metrics.
Abstract
Machine learning approaches often require training and evaluation datasets with a clear separation between positive and negative examples. This risks simplifying and even obscuring the inherent subjectivity present in many tasks. Preserving such variance in content and diversity in datasets is often expensive and laborious. This is especially troubling when building safety datasets for conversational AI systems, as safety is both socially and culturally situated. To demonstrate this crucial aspect of conversational AI safety, and to facilitate in-depth model performance analyses, we introduce the DICES (Diversity In Conversational AI Evaluation for Safety) dataset that contains fine-grained demographic information about raters, high replication of ratings per item to ensure statistical power for analyses, and encodes rater votes as distributions across different demographics to allow…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗ibm-granite/granite-guardian-3.2-5bmodel· 33k dl· ♡ 1233k dl♡ 12
- 🤗ibm-granite/granite-guardian-3.2-3b-a800mmodel· 4.2k dl· ♡ 84.2k dl♡ 8
- 🤗cgus/granite-guardian-3.2-5b-exl2model· 1 dl1 dl
- 🤗RichardErkhov/ibm-granite_-_granite-guardian-3.2-3b-a800m-4bitsmodel· 1 dl1 dl
- 🤗RichardErkhov/ibm-granite_-_granite-guardian-3.2-3b-a800m-8bitsmodel· 4 dl4 dl
- 🤗Mungert/granite-guardian-3.2-3b-a800m-GGUFmodel· 87 dl· ♡ 187 dl♡ 1
- 🤗Mungert/granite-guardian-3.2-5b-GGUFmodel· 57 dl· ♡ 157 dl♡ 1
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational and Text Analysis Methods
