LUCID-SAE: Learning Unified Vision-Language Sparse Codes for Interpretable Concept Discovery

Difei Gu; Yunhe Gao; Gerasimos Chatzoudis; Zihan Dong; Guoning Zhang; Bangwei Guo; Yang Zhou; Mu Zhou; Dimitris Metaxas

arXiv:2602.07311·cs.CV·February 10, 2026

LUCID-SAE: Learning Unified Vision-Language Sparse Codes for Interpretable Concept Discovery

Difei Gu, Yunhe Gao, Gerasimos Chatzoudis, Zihan Dong, Guoning Zhang, Bangwei Guo, Yang Zhou, Mu Zhou, Dimitris Metaxas

PDF

Open Access

TL;DR

LUCID introduces a unified vision-language autoencoder that learns shared and private features for images and text, enabling interpretable, cross-modal concept discovery without labels.

Contribution

It proposes a novel shared latent dictionary for vision and language, with an alignment method that improves interpretability and transferability of features across modalities.

Findings

01

Shared features support patch-level grounding

02

Establish cross-modal neuron correspondence

03

Capture diverse semantic categories beyond objects

Abstract

Sparse autoencoders (SAEs) offer a natural path toward comparable explanations across different representation spaces. However, current SAEs are trained per modality, producing dictionaries whose features are not directly understandable and whose explanations do not transfer across domains. In this study, we introduce LUCID (Learning Unified vision-language sparse Codes for Interpretable concept Discovery), a unified vision-language sparse autoencoder that learns a shared latent dictionary for image patch and text token representations, while reserving private capacity for modality-specific details. We achieve feature alignment by coupling the shared codes with a learned optimal transport matching objective without the need of labeling. LUCID yields interpretable shared features that support patch-level grounding, establish cross-modal neuron correspondence, and enhance robustness…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Multimodal Machine Learning Applications · Generative Adversarial Networks and Image Synthesis