CoBELa: Steering Transparent Generation via Concept Bottlenecks on Energy Landscapes
Sangwon Kim, Kyoungoh Lee, Jeyoun Dong, Kwang-Ju Kim

TL;DR
CoBELa introduces an energy-based, decoder-free concept bottleneck framework for interpretable image generation, supporting compositional interventions and achieving improved accuracy and image quality on benchmark datasets.
Contribution
It proposes a novel energy-based approach that eliminates explicit bottleneck representations and enables post-hoc interpretability and compositional concept interventions without retraining.
Findings
Achieves 75.70%/82.42% concept accuracy on CelebA-HQ and CUB-200-2011.
Attains 6.47/5.37 FID scores, outperforming prior models.
Supports efficient multi-concept interventions through additive energy composition.
Abstract
Generative concept bottleneck models aim to enable interpretable generation by routing synthesis through explicit, user-facing concepts. In practice, prior approaches often rely on non-explicit bottleneck representations (e.g., vision cues or opaque concept embeddings) or black-box decoders to preserve image quality, which weakens the transparency. We propose CoBELa (Concept Bottlenecks on Energy Landscapes), a decoder-free, energy-based framework that eliminates non-explicit bottleneck representations by conditioning generation entirely through per-concept energy functions over the latent space of a frozen pretrained generator-requiring no generator retraining and enabling post-hoc interpretation. Because these concept energies compose additively, CoBELa naturally supports compositional concept interventions: concept conjunction and negation are realized by summing or subtracting…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Generative Adversarial Networks and Image Synthesis · Machine Learning in Healthcare
