Investigating Disability Representations in Text-to-Image Models
Yang Tian, Yu Fan, Liudmila Zavolokina, Sarah Ebling

TL;DR
This paper examines how current text-to-image models depict disabilities, revealing biases and imbalances, and evaluates strategies to improve inclusive representations through analysis and human assessments.
Contribution
It provides a systematic analysis of disability representations in AI-generated images and assesses mitigation strategies to promote diversity and inclusivity.
Findings
Persistent biases in disability portrayals
Mitigation strategies can influence affective framing
Need for ongoing evaluation of model inclusivity
Abstract
Text-to-image generative models have made remarkable progress in producing high-quality visual content from textual descriptions, yet concerns remain about how they represent social groups. While characteristics like gender and race have received increasing attention, disability representations remain underexplored. This study investigates how people with disabilities are represented in AI-generated images by analyzing outputs from Stable Diffusion XL and DALL-E 3 using a structured prompt design. We analyze disability representations by comparing image similarities between generic disability prompts and prompts referring to specific disability categories. Moreover, we evaluate how mitigation strategies influence disability portrayals, with a focus on assessing affective framing through sentiment polarity analysis, combining both automatic and human evaluation. Our findings reveal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDisability Rights and Representation · Multimodal Machine Learning Applications · Text Readability and Simplification
