Interpretability Beyond Classification Output: Semantic Bottleneck Networks
Max Losch, Mario Fritz, Bernt Schiele

TL;DR
Semantic Bottleneck Networks (SBN) introduce interpretable intermediate layers in deep models, enabling transparent analysis of predictions, failure modes, and confidence, demonstrated on street scene segmentation with high accuracy.
Contribution
This paper proposes Semantic Bottleneck Networks with interpretable layers, maintaining state-of-the-art performance while enhancing interpretability and failure analysis capabilities.
Findings
Achieved over 99% accuracy with interpretable segmentation results.
Reduced feature dimensionality from thousands to tens without performance loss.
Enabled analysis of failure cases and confidence prediction using SB-Layer activations.
Abstract
Today's deep learning systems deliver high performance based on end-to-end training. While they deliver strong performance, these systems are hard to interpret. To address this issue, we propose Semantic Bottleneck Networks (SBN): deep networks with semantically interpretable intermediate layers that all downstream results are based on. As a consequence, the analysis on what the final prediction is based on is transparent to the engineer and failure cases and modes can be analyzed and avoided by high-level reasoning. We present a case study on street scene segmentation to demonstrate the feasibility and power of SBN. In particular, we start from a well performing classic deep network which we adapt to house a SB-Layer containing task related semantic concepts (such as object-parts and materials). Importantly, we can recover state of the art performance despite a drastic dimensionality…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Advanced Neural Network Applications
