Matryoshka Concept Bottleneck Models

Ziye Chen; Hongbin Lin; Xinyue Xu; Jie Li; Lijie Hu

arXiv:2605.20612·cs.LG·May 21, 2026

Matryoshka Concept Bottleneck Models

Ziye Chen, Hongbin Lin, Xinyue Xu, Jie Li, Lijie Hu

PDF

TL;DR

This paper introduces the Matryoshka Concept Bottleneck Model (MCBM), a hierarchical approach that reduces intervention costs and improves interpretability in concept-based deep learning models.

Contribution

The paper proposes MCBM, a unified hierarchical architecture that enables adaptive concept utilization and reduces intervention costs from linear to logarithmic scale.

Findings

01

MCBM matches the performance of separate models.

02

Reduces expected intervention costs to O(log K).

03

Enables dynamic and efficient expert interaction.

Abstract

Concept Bottleneck Models (CBMs) have emerged as a prominent paradigm for interpretable deep learning, learning by grounding predictions in human-understandable concepts. However, their practical deployment is hindered by the high cost of test-time intervention, as correcting model errors typically requires human experts to manually inspect and verify a large set of predicted concepts. Existing approaches suffer from a fundamental structural limitation: they either adopt a single static concept set, forcing experts to exhaustively annotate concepts and incurring prohibitive intervention costs, or train multiple models tailored to different concept budgets, resulting in substantial computational and maintenance overhead. To address this challenge, we propose the Matryoshka Concept Bottleneck Model (MCBM), a unified architecture that enables adaptive concept utilization within a single…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.