A Large-scale Multi Domain Leukemia Dataset for the White Blood Cells Detection with Morphological Attributes for Explainability
Abdul Rehman, Talha Meraj, Aiman Mahmood Minhas, Ayisha Imran, Mohsen, Ali, Waqas Sultani

TL;DR
This paper introduces a large, multi-domain leukemia dataset with detailed morphological annotations of white blood cells, captured using different microscopes and magnifications, to aid deep learning diagnosis tools.
Contribution
It provides a comprehensive, annotated leukemia dataset across multiple domains and magnifications, along with baseline models and domain adaptation strategies for improved diagnosis.
Findings
Collected 55k morphological labels from 2.4k images.
Annotated 10.3k WBC types and artifacts.
Established baseline detection and domain adaptation methods.
Abstract
Earlier diagnosis of Leukemia can save thousands of lives annually. The prognosis of leukemia is challenging without the morphological information of White Blood Cells (WBC) and relies on the accessibility of expensive microscopes and the availability of hematologists to analyze Peripheral Blood Samples (PBS). Deep Learning based methods can be employed to assist hematologists. However, these algorithms require a large amount of labeled data, which is not readily available. To overcome this limitation, we have acquired a realistic, generalized, and large dataset. To collect this comprehensive dataset for real-world applications, two microscopes from two different cost spectrums (high-cost HCM and low-cost LCM) are used for dataset capturing at three magnifications (100x, 40x, 10x) through different sensors (high-end camera for HCM, middle-level camera for LCM and mobile-phone camera for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDigital Imaging for Blood Diseases · Machine Learning in Bioinformatics
