DinoBloom: A Foundation Model for Generalizable Cell Embeddings in Hematology
Valentin Koch, Sophia J. Wagner, Salome Kazeminia, Ece Sancar,, Matthias Hehr, Julia Schnabel, Tingying Peng, Carsten Marr

TL;DR
DinoBloom introduces a large-scale foundation model for hematology cell images, demonstrating superior generalization and classification performance across diverse datasets and tasks, thus advancing computational diagnostics in hematology.
Contribution
This work presents the first foundation model for hematology cell images, leveraging extensive datasets and a tailored DINOv2 pipeline to improve generalization and downstream application performance.
Findings
Outperforms existing models in cell-type classification and AML subtyping
Demonstrates strong generalization on external datasets with domain shifts
Provides adaptable models for various hematology imaging tasks
Abstract
In hematology, computational models offer significant potential to improve diagnostic accuracy, streamline workflows, and reduce the tedious work of analyzing single cells in peripheral blood or bone marrow smears. However, clinical adoption of computational models has been hampered by the lack of generalization due to large batch effects, small dataset sizes, and poor performance in transfer learning from natural images. To address these challenges, we introduce DinoBloom, the first foundation model for single cell images in hematology, utilizing a tailored DINOv2 pipeline. Our model is built upon an extensive collection of 13 diverse, publicly available datasets of peripheral blood and bone marrow smears, the most substantial open-source cohort in hematology so far, comprising over 380,000 white blood cell images. To assess its generalization capability, we evaluate it on an external…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗1aurent/vit_small_patch14_224.dinobloommodel· 491 dl· ♡ 1491 dl♡ 1
- 🤗1aurent/vit_base_patch14_224.dinobloommodel· 143 dl· ♡ 1143 dl♡ 1
- 🤗1aurent/vit_large_patch14_224.dinobloommodel· 33 dl33 dl
- 🤗1aurent/vit_giant_patch14_224.dinobloommodel· 2.4k dl2.4k dl
- 🤗virtual-human-chc/DinoBloommodel· 4 dl· ♡ 34 dl♡ 3
- 🤗MarrLab/DinoBloommodel· 17 dl· ♡ 317 dl♡ 3
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSingle-cell and spatial transcriptomics · Mathematical Biology Tumor Growth · Digital Imaging for Blood Diseases
