The Latent Color Subspace: Emergent Order in High-Dimensional Chaos

Mateusz Pach; Jessica Bader; Quentin Bouniot; Serge Belongie; Zeynep Akata

arXiv:2603.12261·cs.LG·March 13, 2026

The Latent Color Subspace: Emergent Order in High-Dimensional Chaos

Mateusz Pach, Jessica Bader, Quentin Bouniot, Serge Belongie, Zeynep Akata

PDF

Open Access

TL;DR

This paper uncovers a structured latent color subspace in high-dimensional autoencoder models, enabling explicit and training-free control over image color attributes in text-to-image generation.

Contribution

It introduces the Latent Color Subspace (LCS) interpretation, revealing a structured color encoding in the latent space and providing a novel, training-free color control method.

Findings

01

The LCS reflects Hue, Saturation, and Lightness in the latent space.

02

LCS can predict and control image color explicitly.

03

The method is training-free and based on closed-form latent-space manipulation.

Abstract

Text-to-image generation models have advanced rapidly, yet achieving fine-grained control over generated images remains difficult, largely due to limited understanding of how semantic information is encoded. We develop an interpretation of the color representation in the Variational Autoencoder latent space of FLUX.1 [Dev], revealing a structure reflecting Hue, Saturation, and Lightness. We verify our Latent Color Subspace (LCS) interpretation by demonstrating that it can both predict and explicitly control color, introducing a fully training-free method in FLUX based solely on closed-form latent-space manipulation. Code is available at https://github.com/ExplainableML/LCS.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Face recognition and analysis · Multimodal Machine Learning Applications