CytoSAE: Interpretable Cell Embeddings for Hematology
Muhammed Furkan Dasdelen, Hyesu Lim, Michele Buck, Katharina S. G\"otze, Carsten Marr, Steffen Schneider

TL;DR
CytoSAE introduces an interpretable sparse autoencoder trained on blood cell images that generalizes well, identifies relevant cellular concepts, and aids in disease classification with explainability at the sub-cellular level.
Contribution
This work presents CytoSAE, a novel sparse autoencoder for hematology that provides interpretable, patient-specific cellular concepts and outperforms existing methods in disease detection.
Findings
Generalizes to diverse hematology datasets
Identifies morphologically relevant cellular concepts validated by experts
Achieves state-of-the-art performance in AML subtype classification
Abstract
Sparse autoencoders (SAEs) emerged as a promising tool for mechanistic interpretability of transformer-based foundation models. Very recently, SAEs were also adopted for the visual domain, enabling the discovery of visual concepts and their patch-wise attribution to tokens in the transformer model. While a growing number of foundation models emerged for medical imaging, tools for explaining their inferences are still lacking. In this work, we show the applicability of SAEs for hematology. We propose CytoSAE, a sparse autoencoder which is trained on over 40,000 peripheral blood single-cell images. CytoSAE generalizes to diverse and out-of-domain datasets, including bone marrow cytology, where it identifies morphologically relevant concepts which we validated with medical experts. Furthermore, we demonstrate scenarios in which CytoSAE can generate patient-specific and disease-specific…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDigital Imaging for Blood Diseases · Single-cell and spatial transcriptomics
