Trajectories, bifurcations and pseudotime in large clinical datasets: applications to myocardial infarction and diabetes data
Sergey E. Golovenkin, Jonathan Bac, Alexander Chervov, Evgeny M., Mirkes, Yuliya V. Orlova, Emmanuel Barillot, Alexander N. Gorban, and Andrei, Zinovyev

TL;DR
This paper introduces a semi-supervised method using elastic principal graphs to analyze large clinical datasets, enabling the extraction of disease trajectories, pseudotime estimation, and prognosis characterization from synchronic observational data.
Contribution
The paper presents a novel approach for modeling clinical trajectories with elastic principal graphs, addressing challenges of mixed data types and missing values in large datasets.
Findings
Successfully applied to myocardial infarction data, revealing disease progression routes.
Effectively characterized diabetic patient readmission trajectories.
Provided a Python tool, ClinTrajan, for clinical trajectory analysis.
Abstract
Large observational clinical datasets become increasingly available for mining associations between various disease traits and administered therapy. These datasets can be considered as representations of the landscape of all possible disease conditions, in which a concrete pathology develops through a number of stereotypical routes, characterized by `points of no return' and `final states' (such as lethal or recovery states). Extracting this information directly from the data remains challenging, especially in the case of synchronic (with a short-term follow up) observations. Here we suggest a semi-supervised methodology for the analysis of large clinical datasets, characterized by mixed data types and missing values, through modeling the geometrical data structure as a bouquet of bifurcating clinical trajectories. The methodology is based on application of elastic principal graphs…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsFeature Selection
