# Interpretable Multiple Instance Learning for Hematologic Diagnosis from Peripheral Blood Smears

**Authors:** Siddharth Singi, Shenghuan Sun, Zhanghan Yin, Riya Gupta, Dylan C. Webb, Khawaja H. Bilal, Deepika Dilip, Linlin Wang, Neeraj Kumar, Swaraj Nanda, Nicolas Sanchez, Jacob G. Van Cleave, Brenda Fried, Sean Paulsen, Ethan S. Yan, Ali Kamali, Argho Sarkar, Allyne Manzo, Jeeyeon Baik, Irem S Isgor, Cesar Colorado-Jimenez, Anthony Cardillo, Leonardo Boiocchi, Aijazuddin Syed, David Kim, Brie Kezlarian-Sachs, Maly Fenelus, Alexander Chan, Mariko Yabe, Samuel I McCash, Menglei Zhu, Simon Mantha, Orly Ardon, Lauren McVoy, Wenbin Xiao, Mikhail Roshal, Oscar Lin, Ahmet Dogan, Iain Carmichael, Chad Vanderbilt, Gregory M. Goldgof

PMC · DOI: 10.21203/rs.3.rs-6933141/v1 · Research Square · 2025-10-31

## TL;DR

This paper introduces CAREMIL, an interpretable machine learning framework that improves hematologic diagnosis from blood smears by combining cell-level features with whole-slide predictions.

## Contribution

CAREMIL is a novel weakly supervised attention-based MIL framework that outperforms existing methods and provides interpretable insights for hematologic diagnosis.

## Key findings

- CAREMIL paired with DeepHeme achieves state-of-the-art performance in diagnosing AML, MDS, and HCL with high AUROC scores.
- CAREMIL improves performance especially when using out-of-domain encoders like those trained on ImageNet or open-source pathology models.
- Attention values from CAREMIL highlight diagnostically relevant cells and morphometric signatures, enabling biological interpretability.

## Abstract

Accurate diagnosis of hematologic malignancies from peripheral blood smears (PBSs) requires integrating cellular morphology and composition across hundreds of white blood cells. Existing approaches primarily automate single-cell classification and do not provide whole-slide diagnostic predictions. We present a full network that utilizes a highly performative cell-based encoder (DeepHeme) for feature extraction paired with our weakly supervised framework using attention-based multiple instance learning (MIL) that we call CAREMIL (Cell AggRegation, Explainable, Multiple Instance Learning). Upon evaluating various popular image encoders and MIL architectures, the combination of DeepHeme and CAREMIL is the best performing pipeline on our disease classification task. CAREMIL proves to be a robust aggregation function that outperforms the most commonly used slide level aggregation function (gated multiple instance learning) across several encoder types. The greatest improvements in performance gain with CAREMIL is observed when using out-of-domain encoders, including an encoder trained on ImageNet and leading open-source pathology foundational models (UNI2 and Virchow2). CAREMIL plus DeepHeme achieves the highest diagnostic performance across acute leukemia (AML), myelodysplastic syndromes (MDS), and hairy cell leukemia (HCL) (AUROCs 0.999, 0.891, and 0.945, respectively), and identifies AML disease even in cases with minimal or absent circulating blasts. Attention values assigned by CAREMIL highlight diagnostically relevant cells and reveal disease-specific morphometric signatures, enabling biological interpretability and case-level insight. CAREMIL remains robust to misclassified cell types by the cell image encoder and does not require explicit cell-level supervision. These findings position CAREMIL as an effective and interpretable multiple instance learning framework for hematologic slide diagnosis, with potential to extend to bone marrow aspirates, cytology, and other liquid biopsy specimens, and to support a broader shift toward quantitative, morphology-informed diagnostics in hematology.

## Full-text entities

- **Diseases:** HCL (MESH:D007943), AML (MESH:D015470), hematologic malignancies (MESH:D019337), MDS (MESH:D009190)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12636719/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12636719/full.md

## References

36 references — full list in the complete paper: https://tomesphere.com/paper/PMC12636719/full.md

---
Source: https://tomesphere.com/paper/PMC12636719