Exploring the Representation Manifolds of Stable Diffusion Through the Lens of Intrinsic Dimension
Henry Kvinge, Davis Brown, Charles Godfrey

TL;DR
This paper investigates how prompts influence the geometric properties of internal representations in Stable Diffusion, revealing that prompt choice significantly affects the intrinsic dimension of model representations across different layers.
Contribution
It provides the first mathematical analysis of prompt effects on the intrinsic dimension of representations in Stable Diffusion, highlighting layer-dependent impacts.
Findings
Prompt choice significantly affects intrinsic dimension.
Intrinsic dimension correlates with prompt perplexity in bottleneck layers.
Layer-dependent variations in representation geometry were observed.
Abstract
Prompting has become an important mechanism by which users can more effectively interact with many flavors of foundation model. Indeed, the last several years have shown that well-honed prompts can sometimes unlock emergent capabilities within such models. While there has been a substantial amount of empirical exploration of prompting within the community, relatively few works have studied prompting at a mathematical level. In this work we aim to take a first step towards understanding basic geometric properties induced by prompts in Stable Diffusion, focusing on the intrinsic dimension of internal representations within the model. We find that choice of prompt has a substantial impact on the intrinsic dimension of representations at both layers of the model which we explored, but that the nature of this impact depends on the layer being considered. For example, in certain bottleneck…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Visualization and Analytics
MethodsDiffusion
