Can Machine Learning Assist in Diagnosis of Primary Immune Thrombocytopenia? A feasibility study
Haroon Miah, Dimitrios Kollias, Giacinto Luca Pedone, Drew Provan,, Frederick Chen

TL;DR
This study explores the feasibility of using machine learning models to diagnose Primary Immune Thrombocytopenia (ITP) from routine blood tests and demographic data, highlighting the potential and limitations of ML in clinical diagnosis.
Contribution
It demonstrates that decision tree and random forest models can effectively diagnose ITP with high accuracy and fairness, emphasizing the importance of model choice and data features.
Findings
Decision Tree and Random Forest achieved high predictive accuracy.
Models without demographic data had better accuracy but lower fairness.
Platelet count was the most significant variable for diagnosis.
Abstract
Primary Immune thrombocytopenia (ITP) is a rare autoimmune disease characterised by immune-mediated destruction of peripheral blood platelets in patients leading to low platelet counts and bleeding. The diagnosis and effective management of ITP is challenging because there is no established test to confirm the disease and no biomarker with which one can predict the response to treatment and outcome. In this work we conduct a feasibility study to check if machine learning can be applied effectively for diagnosis of ITP using routine blood tests and demographic data in a non-acute outpatient setting. Various ML models, including Logistic Regression, Support Vector Machine, k-Nearest Neighbor, Decision Tree and Random Forest, were applied to data from the UK Adult ITP Registry and a general hematology clinic. Two different approaches were investigated: a demographic-unaware and a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPlatelet Disorders and Treatments · Blood properties and coagulation · Blood groups and transfusion
MethodsLogistic Regression
