Zero-shot Concept Bottleneck Models

Shin'ya Yamaguchi; Kosuke Nishida; Daiki Chijiwa; Yasutoshi Ida

arXiv:2502.09018·cs.LG·April 6, 2026

Zero-shot Concept Bottleneck Models

Shin'ya Yamaguchi, Kosuke Nishida, Daiki Chijiwa, Yasutoshi Ida

PDF

1 Repo

TL;DR

Zero-shot concept bottleneck models (Z-CBMs) enable interpretable predictions without training by leveraging a large concept bank and cross-modal retrieval, reducing resource needs.

Contribution

The paper introduces Z-CBMs, a novel approach that predicts concepts and labels in a zero-shot manner using a large concept bank and dynamic retrieval.

Findings

01

Z-CBMs achieve interpretable concept predictions without additional training.

02

The model effectively retrieves relevant concepts via cross-modal search.

03

Sparse linear regression selects essential concepts for label inference.

Abstract

Concept bottleneck models (CBMs) are inherently interpretable and intervenable neural network models, which explain their final label prediction by the intermediate prediction of high-level semantic concepts. However, they require target task training to learn input-to-concept and concept-to-label mappings, incurring target dataset collections and training resources. In this paper, we present zero-shot concept bottleneck models (Z-CBMs), which predict concepts and labels in a fully zero-shot manner without training neural networks. Z-CBMs utilize a large-scale concept bank, which is composed of millions of vocabulary extracted from the web, to describe arbitrary input in various domains. For the input-to-concept mapping, we introduce concept retrieval, which dynamically finds input-related concepts by the cross-modal search on the concept bank. In the concept-to-label inference, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yshinya6/zcbm
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.