Survival-supervised latent Dirichlet allocation models for genomic analysis of time-to-event outcomes
John A. Dawson, Christina Kendziorski

TL;DR
This paper introduces survLDA, a novel survival-supervised latent Dirichlet allocation model that integrates high-dimensional genomic and clinical data to identify patient subgroups with distinct survival outcomes in cancer studies.
Contribution
The paper extends LDA to genomics and incorporates survival supervision, enabling the discovery of meaningful patient subgroups based on genomic and clinical features.
Findings
Identified patient subgroups with different survival rates in ovarian cancer
Characterized genomic features associated with survival outcomes
Demonstrated effectiveness of survLDA on TCGA data
Abstract
Two challenging problems in the clinical study of cancer are the characterization of cancer subtypes and the classification of individual patients according to those subtypes. Statistical approaches addressing these problems are hampered by population heterogeneity and challenges inherent in data integration across high-dimensional, diverse covariates. We have developed a survival-supervised latent Dirichlet allocation (survLDA) modeling framework to address these concerns. LDA models have proven extremely effective at identifying themes common across large collections of text, but applications to genomics have been limited. Our framework extends LDA to the genome by considering each patient as a `document' with `text' constructed from clinical and high-dimensional genomic measurements. We then further extend the framework to allow for supervision by a time-to-event response. The model…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGene expression and cancer classification · Bayesian Methods and Mixture Models · Cancer-related molecular mechanisms research
