SDGCCA: Supervised Deep Generalized Canonical Correlation Analysis for Multi-omics Integration
Jeongyoung Hwang (1), Sehwan Moon (2), Hyunju Lee (1, 2) ((1), Artificial Intelligence Graduate School of Gwangju Institute of Science and, Technology, (2) School of Electrical Engineering, Computer Science of, Gwangju Institute of Science, Technology)

TL;DR
SDGCCA is a novel deep learning method that integrates multi-omics data to improve phenotype classification and biomarker discovery by modeling complex nonlinear correlations across multiple data views.
Contribution
It introduces a supervised deep generalized canonical correlation analysis method that considers multiple data views for nonlinear correlation modeling and phenotype discrimination.
Findings
Outperformed existing CCA-based and supervised methods in AD and cancer prediction.
Effectively identified biologically relevant multi-omics biomarkers.
Demonstrated capability for feature selection and gene clustering related to diseases.
Abstract
Integration of multi-omics data provides opportunities for revealing biological mechanisms related to certain phenotypes. We propose a novel method of multi-omics integration called supervised deep generalized canonical correlation analysis (SDGCCA) for modeling correlation structures between nonlinear multi-omics manifolds, aiming for improving classification of phenotypes and revealing biomarkers related to phenotypes. SDGCCA addresses the limitations of other canonical correlation analysis (CCA)-based models (e.g., deep CCA, deep generalized CCA) by considering complex/nonlinear cross-data correlations and discriminating phenotype groups. Although there are a few methods for nonlinear CCA projections for discriminant purposes of phenotypes, they only consider two views. On the other hand, SDGCCA is the nonlinear multiview CCA projection method for discrimination. When we applied…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBioinformatics and Genomic Networks · Gene expression and cancer classification
MethodsFeature Selection
