T2ISafety: Benchmark for Assessing Fairness, Toxicity, and Privacy in Image Generation

Lijun Li; Zhelun Shi; Xuhao Hu; Bowen Dong; Yiran Qin; Xihui Liu; Lu Sheng; Jing Shao

arXiv:2501.12612·cs.CL·July 28, 2025

T2ISafety: Benchmark for Assessing Fairness, Toxicity, and Privacy in Image Generation

Lijun Li, Zhelun Shi, Xuhao Hu, Bowen Dong, Yiran Qin, Xihui Liu, Lu Sheng, Jing Shao

PDF

Open Access 1 Repo 1 Datasets

TL;DR

T2ISafety is a comprehensive benchmark designed to evaluate text-to-image models across toxicity, fairness, and bias, revealing critical safety issues and aiding in the development of safer AI image generation systems.

Contribution

The paper introduces T2ISafety, a detailed safety benchmark with a large dataset and evaluator for assessing T2I models on multiple safety dimensions, addressing gaps in current evaluation methods.

Findings

01

Persistent racial fairness issues in models

02

Models tend to generate toxic content

03

Variation in privacy protection across models

Abstract

Text-to-image (T2I) models have rapidly advanced, enabling the generation of high-quality images from text prompts across various domains. However, these models present notable safety concerns, including the risk of generating harmful, biased, or private content. Current research on assessing T2I safety remains in its early stages. While some efforts have been made to evaluate models on specific safety dimensions, many critical risks remain unexplored. To address this gap, we introduce T2ISafety, a safety benchmark that evaluates T2I models across three key domains: toxicity, fairness, and bias. We build a detailed hierarchy of 12 tasks and 44 categories based on these three domains, and meticulously collect 70K corresponding prompts. Based on this taxonomy and prompt set, we build a large-scale T2I dataset with 68K manually annotated images and train an evaluator capable of detecting…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

adwardlee/t2i_safety
noneOfficial

Datasets

OpenSafetyLab/t2i_safety_dataset
dataset· 339 dl
339 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBiomedical Ethics and Regulation

MethodsDiffusion