Aligned but Stereotypical? The Hidden Influence of System Prompts on Social Bias in LVLM-Based Text-to-Image Models

NaHyeon Park; Namin An; Kunhee Kim; Soyeon Yoon; Jiahao Huo; Hyunjung Shim

arXiv:2512.04981·cs.CV·December 5, 2025

Aligned but Stereotypical? The Hidden Influence of System Prompts on Social Bias in LVLM-Based Text-to-Image Models

NaHyeon Park, Namin An, Kunhee Kim, Soyeon Yoon, Jiahao Huo, Hyunjung Shim

PDF

Open Access 1 Datasets

TL;DR

This paper investigates how system prompts influence social bias in LVLM-based text-to-image models, revealing prompts as a key factor in bias propagation and proposing a framework to mitigate this bias without retraining.

Contribution

It identifies system prompts as a primary source of bias in LVLM-based T2I models and introduces FairPro, a framework to reduce bias through self-auditing prompts at test time.

Findings

01

LVLM-based models produce more social bias than non-LVLM models.

02

System prompts encode demographic priors that influence bias in generated images.

03

FairPro significantly reduces demographic bias while maintaining image quality.

Abstract

Large vision-language model (LVLM) based text-to-image (T2I) systems have become the dominant paradigm in image generation, yet whether they amplify social biases remains insufficiently understood. In this paper, we show that LVLM-based models produce markedly more socially biased images than non-LVLM-based models. We introduce a 1,024 prompt benchmark spanning four levels of linguistic complexity and evaluate demographic bias across multiple attributes in a systematic manner. Our analysis identifies system prompts, the predefined instructions guiding LVLMs, as a primary driver of biased behavior. Through decoded intermediate representations, token-probability diagnostics, and embedding-association analyses, we reveal how system prompts encode demographic priors that propagate into image synthesis. To this end, we propose FairPro, a training-free meta-prompting framework that enables…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

nahyeonkaty/fairpro
dataset· 4 dl
4 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning · Generative Adversarial Networks and Image Synthesis