SCALEX: Scalable Concept and Latent Exploration for Diffusion Models

E. Zhixuan Zeng; Yuhao Chen; Alexander Wong

arXiv:2511.13750·cs.LG·November 24, 2025

SCALEX: Scalable Concept and Latent Exploration for Diffusion Models

E. Zhixuan Zeng, Yuhao Chen, Alexander Wong

PDF

Open Access

TL;DR

SCALEX is a scalable, automated framework that explores diffusion model latent spaces using natural language prompts, enabling bias detection and semantic analysis without retraining or manual labeling.

Contribution

It introduces SCALEX, a novel method for zero-shot, large-scale semantic exploration of diffusion models' latent spaces using natural language prompts.

Findings

01

Detects gender bias in profession prompts

02

Ranks semantic alignment of identity descriptors

03

Reveals clustered conceptual structures

Abstract

Image generation models frequently encode social biases, including stereotypes tied to gender, race, and profession. Existing methods for analyzing these biases in diffusion models either focus narrowly on predefined categories or depend on manual interpretation of latent directions. These constraints limit scalability and hinder the discovery of subtle or unanticipated patterns. We introduce SCALEX, a framework for scalable and automated exploration of diffusion model latent spaces. SCALEX extracts semantically meaningful directions from H-space using only natural language prompts, enabling zero-shot interpretation without retraining or labelling. This allows systematic comparison across arbitrary concepts and large-scale discovery of internal model associations. We show that SCALEX detects gender bias in profession prompts, ranks semantic alignment across identity descriptors, and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Computational and Text Analysis Methods · Multimodal Machine Learning Applications