SCALEX: Scalable Concept and Latent Exploration for Diffusion Models
E. Zhixuan Zeng, Yuhao Chen, Alexander Wong

TL;DR
SCALEX is a scalable, automated framework that explores diffusion model latent spaces using natural language prompts, enabling bias detection and semantic analysis without retraining or manual labeling.
Contribution
It introduces SCALEX, a novel method for zero-shot, large-scale semantic exploration of diffusion models' latent spaces using natural language prompts.
Findings
Detects gender bias in profession prompts
Ranks semantic alignment of identity descriptors
Reveals clustered conceptual structures
Abstract
Image generation models frequently encode social biases, including stereotypes tied to gender, race, and profession. Existing methods for analyzing these biases in diffusion models either focus narrowly on predefined categories or depend on manual interpretation of latent directions. These constraints limit scalability and hinder the discovery of subtle or unanticipated patterns. We introduce SCALEX, a framework for scalable and automated exploration of diffusion model latent spaces. SCALEX extracts semantically meaningful directions from H-space using only natural language prompts, enabling zero-shot interpretation without retraining or labelling. This allows systematic comparison across arbitrary concepts and large-scale discovery of internal model associations. We show that SCALEX detects gender bias in profession prompts, ranks semantic alignment across identity descriptors, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Computational and Text Analysis Methods · Multimodal Machine Learning Applications
