Visual Subpopulation Discovery and Validation in Cohort Study Data
Shiva Alemzadeh, Tommy Hielscher, Uli Niemann, Lena Cibulski, Till, Ittermann, Henry V\"olzke, Myra Spiliopoulou, Bernhard Preim

TL;DR
This paper introduces S-ADVIsED, an interactive visual analytics framework that helps epidemiologists discover, explore, and validate meaningful subpopulations in large, heterogeneous cohort study data using subspace clustering and visualization techniques.
Contribution
The paper presents a novel visual analytics tool that integrates subspace clustering with validation features, enabling epidemiologists to explore and verify subpopulation findings effectively.
Findings
Identified a subpopulation with high liver fat that deviates in multiple health variables.
Demonstrated the tool's capability to validate subpopulations across different cohorts.
Provided insights into health risk factors through visual exploration.
Abstract
Epidemiology aims at identifying subpopulations of cohort participants that share common characteristics (e.g. alcohol consumption) to explain risk factors of diseases in cohort study data. These data contain information about the participants' health status gathered from questionnaires, medical examinations, and image acquisition. Due to the growing volume and heterogeneity of epidemiological data, the discovery of meaningful subpopulations is challenging. Subspace clustering can be leveraged to find subpopulations in large and heterogeneous cohort study datasets. In our collaboration with epidemiologists, we realized their need for a tool to validate discovered subpopulations. For this purpose, identified subpopulations should be searched for independent cohorts to check whether the findings apply there as well. In this paper we describe our interactive Visual Analytics framework…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Visualization and Analytics · Data-Driven Disease Surveillance
