ParseCaps: An Interpretable Parsing Capsule Network for Medical Image Diagnosis
Xinyu Geng, Jiaming Wang, Jun Xu

TL;DR
ParseCaps introduces an interpretable capsule network with a hierarchical parse-tree structure and sparse axial attention routing, significantly improving medical image classification accuracy and interpretability for clinical use.
Contribution
The paper proposes ParseCaps, a novel capsule network architecture that enhances depth and interpretability using sparse axial attention routing and parse convolutional layers.
Findings
Outperforms other capsule networks in accuracy and robustness
Provides interpretable explanations without requiring concept labels
Reduces redundancy and improves classification on medical datasets
Abstract
Deep learning has excelled in medical image classification, but its clinical application is limited by poor interpretability. Capsule networks, known for encoding hierarchical relationships and spatial features, show potential in addressing this issue. Nevertheless, traditional capsule networks often underperform due to their shallow structures, and deeper variants lack hierarchical architectures, thereby compromising interpretability. This paper introduces a novel capsule network, ParseCaps, which utilizes the sparse axial attention routing and parse convolutional capsule layer to form a parse-tree-like structure, enhancing both depth and interpretability. Firstly, sparse axial attention routing optimizes connections between child and parent capsules, as well as emphasizes the weight distribution across instantiation parameters of parent capsules. Secondly, the parse convolutional…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAI in cancer detection · Brain Tumor Detection and Classification
MethodsSoftmax · Attention Is All You Need · Capsule Network · Axial Attention
