Explain via Any Concept: Concept Bottleneck Model with Open Vocabulary Concepts
Andong Tan, Fengtao Zhou, Hao Chen

TL;DR
This paper introduces OpenCBM, an innovative concept bottleneck model that incorporates open vocabulary concepts using vision-language pre-trained models, enabling flexible, post-training explanations and significantly improving classification accuracy.
Contribution
OpenCBM is the first CBM to support open vocabulary concepts, allowing dynamic addition, removal, or replacement of concepts for interpretability after training.
Findings
Outperforms previous CBMs by 9% accuracy on CUB-200-2011
Enables explanation with any user-defined concepts post-training
Aligns image features with CLIP for open vocabulary concept reconstruction
Abstract
The concept bottleneck model (CBM) is an interpretable-by-design framework that makes decisions by first predicting a set of interpretable concepts, and then predicting the class label based on the given concepts. Existing CBMs are trained with a fixed set of concepts (concepts are either annotated by the dataset or queried from language models). However, this closed-world assumption is unrealistic in practice, as users may wonder about the role of any desired concept in decision-making after the model is deployed. Inspired by the large success of recent vision-language pre-trained models such as CLIP in zero-shot classification, we propose "OpenCBM" to equip the CBM with open vocabulary concepts via: (1) Aligning the feature space of a trainable image feature extractor with that of a CLIP's image encoder via a prototype based feature alignment; (2) Simultaneously training an image…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Modeling and Causal Inference · Topic Modeling
MethodsSparse Evolutionary Training · Contrastive Language-Image Pre-training
