Discovering Fine-Grained Visual-Concept Relations by Disentangled Optimal Transport Concept Bottleneck Models

Yan Xie; Zequn Zeng; Hao Zhang; Yucheng Ding; Yi Wang; Zhengjue Wang; Bo Chen; Hongwei Liu

arXiv:2505.07209·cs.CV·May 13, 2025

Discovering Fine-Grained Visual-Concept Relations by Disentangled Optimal Transport Concept Bottleneck Models

Yan Xie, Zequn Zeng, Hao Zhang, Yucheng Ding, Yi Wang, Zhengjue Wang, Bo Chen, Hongwei Liu

PDF

Open Access

TL;DR

This paper introduces DOT-CBM, a novel framework that models fine-grained visual-concept relations using optimal transport, improving interpretability and accuracy in concept-based image classification.

Contribution

The paper proposes a disentangled optimal transport approach for CBMs, enabling explicit local feature alignment and addressing biases for more reliable and interpretable predictions.

Findings

01

Achieves state-of-the-art performance on image classification tasks.

02

Provides reliable visualization of concept contributions through heatmaps.

03

Enhances out-of-distribution generalization and local part detection.

Abstract

Concept Bottleneck Models (CBMs) try to make the decision-making process transparent by exploring an intermediate concept space between the input image and the output prediction. Existing CBMs just learn coarse-grained relations between the whole image and the concepts, less considering local image information, leading to two main drawbacks: i) they often produce spurious visual-concept relations, hence decreasing model reliability; and ii) though CBMs could explain the importance of every concept to the final prediction, it is still challenging to tell which visual region produces the prediction. To solve these problems, this paper proposes a Disentangled Optimal Transport CBM (DOT-CBM) framework to explore fine-grained visual-concept relations between local image patches and concepts. Specifically, we model the concept prediction process as a transportation problem between the patches…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Generative Adversarial Networks and Image Synthesis · Explainable Artificial Intelligence (XAI)