Hierarchical, Interpretable, Label-Free Concept Bottleneck Model
Haodong Xie, Yujun Cai, Rahul Singh Maharjan, Yiwei Wang, Federico Tavella, and Angelo Cangelosi

TL;DR
HIL-CBM is a hierarchical, interpretable, label-free concept bottleneck model that improves interpretability and accuracy by mirroring human cognitive abstraction levels without needing relational annotations.
Contribution
It introduces a hierarchical framework for CBMs with a gradient-based loss and dual classification heads, enhancing interpretability and performance without relational concept annotations.
Findings
HIL-CBM outperforms state-of-the-art sparse CBMs in accuracy.
Human evaluations show HIL-CBM provides more interpretable explanations.
HIL-CBM maintains a hierarchical, label-free approach to feature concepts.
Abstract
Concept Bottleneck Models (CBMs) introduce interpretability to black-box deep learning models by predicting labels through human-understandable concepts. However, unlike humans, who identify objects at different levels of abstraction using both general and specific features, existing CBMs operate at a single semantic level in both concept and label space. We propose HIL-CBM, a Hierarchical Interpretable Label-Free Concept Bottleneck Model that extends CBMs into a hierarchical framework to enhance interpretability by more closely mirroring the human cognitive process. HIL-CBM enables classification and explanation across multiple semantic levels without requiring relational concept annotations. HIL-CBM aligns the abstraction level of concept-based explanations with that of model predictions, progressing from abstract to concrete. This is achieved by (i) introducing a gradient-based…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
