SHAP-CAT: A interpretable multi-modal framework enhancing WSI classification via virtual staining and shapley-value-based multimodal fusion
Jun Wang, Yu Mao, Nan Guan, Chun Jason Xue

TL;DR
SHAP-CAT is an interpretable multimodal framework that improves whole slide image classification by using virtual staining and Shapley-value-based fusion to select important features, leading to significant accuracy gains.
Contribution
The paper introduces SHAP-CAT, a novel interpretable multimodal framework that employs Shapley-value-based dimension reduction and virtual staining for enhanced histopathology classification.
Findings
Achieved up to 11% accuracy improvement on histopathology datasets.
Demonstrated effective feature importance attribution via Shapley values.
Enhanced model interpretability with multimodal fusion and synthetic data.
Abstract
The multimodal model has demonstrated promise in histopathology. However, most multimodal models are based on H\&E and genomics, adopting increasingly complex yet black-box designs. In our paper, we propose a novel interpretable multimodal framework named SHAP-CAT, which uses a Shapley-value-based dimension reduction technique for effective multimodal fusion. Starting with two paired modalities -- H\&E and IHC images, we employ virtual staining techniques to enhance limited input data by generating a new clinical-related modality. Lightweight bag-level representations are extracted from image modalities and a Shapley-value-based mechanism is used for dimension reduction. For each dimension of the bag-level representation, attribution values are calculated to indicate how changes in the specific dimensions of the input affect the model output. In this way, we select a few top important…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Handwritten Text Recognition Techniques · Image Processing and 3D Reconstruction
