Enabling scalable clinical interpretation of ML-based phenotypes using real world data
Owen Parsons (1), Nathan E Barlow (1), Janie Baxter (1), Karen, Paraschin (2), Andrea Derix (2), Peter Hein (2), Robert D\"urichen (1) ((1), Sensyne Health, Oxford, UK, (2) Research, Development, Pharmaceuticals,, Bayer AG, Wuppertal, Germany)

TL;DR
This paper presents tools and methods to improve the scalability, interpretability, and clinical relevance of ML-based patient stratification using large EHR datasets, enabling more meaningful insights in healthcare research.
Contribution
It introduces novel tools for clinical evaluation and interpretation of unsupervised patient stratification, reducing complexity and analysis time in large EHR datasets.
Findings
Meta clustering reduced 72 patient clusters to 3
Surrogate models identified key features like blood sodium in heart failure stratification
Curation improved clinical relevance of stratification results
Abstract
The availability of large and deep electronic healthcare records (EHR) datasets has the potential to enable a better understanding of real-world patient journeys, and to identify novel subgroups of patients. ML-based aggregation of EHR data is mostly tool-driven, i.e., building on available or newly developed methods. However, these methods, their input requirements, and, importantly, resulting output are frequently difficult to interpret, especially without in-depth data science or statistical training. This endangers the final step of analysis where an actionable and clinically meaningful interpretation is needed.This study investigates approaches to perform patient stratification analysis at scale using large EHR datasets and multiple clustering methods for clinical research. We have developed several tools to facilitate the clinical evaluation and interpretation of unsupervised…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare · Chronic Disease Management Strategies · Artificial Intelligence in Healthcare
