SCOPE: A Dataset of Stereotyped Prompts for Counterfactual Fairness Assessment of LLMs
Alessandra Parziale, Gianmario Voria, Valeria Pontillo, Andrea De Lucia, Gemma Catolino, Fabio Palomba

TL;DR
SCOPE is a large, diverse dataset of counterfactual prompts designed to systematically evaluate fairness and bias in large language models across multiple topics, bias dimensions, and communicative intents.
Contribution
The paper introduces SCOPE, a comprehensive dataset of 241,280 prompts with 120,640 counterfactual pairs, enabling detailed analysis of demographic bias and fairness in LLMs.
Findings
SCOPE covers 1,438 topics and 1,536 demographic groups.
Prompts are generated under four communicative intents for broad interaction coverage.
The dataset facilitates systematic evaluation of bias, robustness, and counterfactual consistency in LLMs.
Abstract
Large Language Models (LLMs) now serve as the foundation for a wide range of applications, from conversational assistants to decision support tools, making the issue of fairness in their results increasingly important. Previous studies have shown that LLM outputs can shift when prompts reference different demographic groups, even when intent and semantic content remain constant. However, existing resources for probing such disparities rely primarily on small, template-based counterfactual examples or fixed sentence pairs. These benchmarks offer limited linguistic diversity, narrow topical coverage, and little support for analyzing how communicative intent affects model behavior. To address these limitations, we introduce SCOPE (Stereotype-COnditioned Prompts for Evaluation), a large-scale dataset of counterfactual prompt pairs designed to enable systematic investigation of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
