Matryoshka Concept Bottleneck Models
Ziye Chen, Hongbin Lin, Xinyue Xu, Jie Li, Lijie Hu

TL;DR
This paper introduces the Matryoshka Concept Bottleneck Model (MCBM), a hierarchical approach that reduces intervention costs and improves interpretability in concept-based deep learning models.
Contribution
The paper proposes MCBM, a unified hierarchical architecture that enables adaptive concept utilization and reduces intervention costs from linear to logarithmic scale.
Findings
MCBM matches the performance of separate models.
Reduces expected intervention costs to O(log K).
Enables dynamic and efficient expert interaction.
Abstract
Concept Bottleneck Models (CBMs) have emerged as a prominent paradigm for interpretable deep learning, learning by grounding predictions in human-understandable concepts. However, their practical deployment is hindered by the high cost of test-time intervention, as correcting model errors typically requires human experts to manually inspect and verify a large set of predicted concepts. Existing approaches suffer from a fundamental structural limitation: they either adopt a single static concept set, forcing experts to exhaustively annotate concepts and incurring prohibitive intervention costs, or train multiple models tailored to different concept budgets, resulting in substantial computational and maintenance overhead. To address this challenge, we propose the Matryoshka Concept Bottleneck Model (MCBM), a unified architecture that enables adaptive concept utilization within a single…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
