Deep Learning-Enabled Multi-Omics Integration: A New Frontier in Precise Drug Target Discovery
Yufei Ren, Haotian Bai, Jihan Wang, Yanning Yang, Yangyang Wang

TL;DR
This review explores how deep learning helps combine different biological data to discover new drug targets, improving precision medicine and reducing pharmaceutical costs.
Contribution
The paper introduces a systematic review of deep learning methods for multi-omics integration in drug target discovery.
Findings
DL-driven multi-omics integration can identify novel disease drivers and therapeutic targets.
Challenges include data sparsity, model interpretability, and target validation.
Emerging AI techniques like Generative AI and XAI offer opportunities to overcome these challenges.
Abstract
The identification of novel drug targets is essential for developing effective therapies and reducing the substantial costs and high failure rates in pharmaceutical research. Traditional analytical methods focusing on a single biological layer often fail to capture the systemic complexity of human diseases. This review summarizes advancements in computational frameworks that integrate diverse biological data, including genes, proteins, and metabolites, to facilitate drug target discovery. We examine how these methodologies identify disease drivers, predict genetic interactions, and prioritize potential therapeutic candidates. Furthermore, this work evaluates critical challenges such as data sparsity, the limited interpretability of models, and the necessity of assessing the chemical and clinical feasibility of predicted targets. By addressing data inconsistencies and establishing…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4- —Shaanxi Provincial Key Laboratory of Infection and Immune Diseases
- —Yan’an University
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBioinformatics and Genomic Networks · Computational Drug Discovery Methods · Genetic Associations and Epidemiology
1. Introduction
The transition toward precision medicine represents a fundamental shift in drug discovery, moving beyond symptom-based treatments to target the molecular drivers of disease [1,2,3]. Despite these advancements, the pharmaceutical industry continues to face the challenges of “Eroom’s Law,” where escalating research costs are met with stagnant approval rates [4,5,6]. This trend is largely driven by high attrition in late-stage clinical trials, often resulting from the selection of sub-optimal or non-druggable targets [7,8,9]. Consequently, the identification of high-confidence therapeutic targets has become the pivotal bottleneck in improving clinical success rates. Traditionally, target discovery has relied on reductionist single-omics methods, such as Genome-Wide Association Studies (GWASs) or differential expression analysis [10,11]. However, these approaches frequently fail to capture the systemic complexity of human diseases, overlooking integrated regulatory mechanisms involving epigenetic modifications, post-translational interactions, and metabolic perturbations [12].
To resolve these limitations, the integration of multi-omics data, including genomics, transcriptomics, proteomics, and metabolomics, has become essential for reconstructing the functional molecular networks underlying pathogenesis [13,14,15]. Unlike isolated datasets, multi-omics integration provides complementary biological insights, enabling researchers to reconstruct complex molecular networks underlying disease pathogenesis [16,17,18]. Consequently, multi-omics integration facilitates the identification of therapeutic targets that are biologically actionable and clinically relevant [19,20,21]. However, transforming these high-dimensional data into actionable medical knowledge presents formidable computational challenges. Multi-omics datasets are characterized by high heterogeneity, sparsity, scale differences, and the “curse of dimensionality,” rendering traditional bioinformatics and statistical methods insufficient for effective integration [9,22,23].
DL represents a transformative breakthrough in handling this biological complexity, offering capabilities far beyond traditional methods [24,25,26]. While classical algorithms require labor-intensive feature engineering, DL models excel at extracting high-level abstractions from raw data through automated representation learning. Specific architectures have been tailored to overcome the hurdles of multi-omics analysis: variational autoencoders (VAEs) serve to denoise and compress sparse omics data into robust low-dimensional representations [27], convolutional neural networks (CNNs) extract local patterns from sequences or spatial data [28], and graph neural networks (GNNs) incorporate prior knowledge to preserve the structural topology of biological interactions [29]. Furthermore, the transformer architecture employs attention mechanisms to differentially weight omics modalities and capture long-range dependencies within biological sequences [30], as well as diffusion models for generative data augmentation. Recent advancements also include large-scale pre-trained foundation models, which enable accurate predictions in data-scarce scenarios [31,32], effectively overcoming the sample size bottleneck. By addressing the intrinsic complexity of multi-omics data, these DL-driven frameworks transcend traditional limitations to support a holistic characterization of disease signatures. This capability empowers researchers to capture intricate non-linear dependencies and pinpoint high-confidence therapeutic vulnerabilities, thereby catalyzing a shift toward data-driven precision medicine. Figure 1 provides a schematic overview of this DL workflow, illustrating the systematic progression from multi-omics data acquisition through the DL pipeline to downstream therapeutic applications.
This review aims to comprehensively and systematically summarize the application of DL-driven multi-omics integration in drug target discovery. First, we outline the multi-omics data foundation underpinning target discovery, highlighting the biological significance of different omics layers and the key public databases that support data-driven target identification. We then provide a structured overview of multi-omics integration strategies and dimensionality reduction technologies, and we systematically examine the major DL architectures for multi-omics integration tasks. Subsequently, we discuss the applications of DL-enabled multi-omics integration in drug target discovery tasks, including disease driver identification, synthetic lethality prediction, and target prioritization. Finally, we critically examine the current limitations of DL-enabled multi-omics approaches, with a particular focus on data quality, model interpretability, and experimental validation, and discuss emerging opportunities driven by explainable artificial intelligence, generative artificial intelligence, large multimodal models, and multidimensional feasibility assessment frameworks that may shape the future of precision medicine.
2. The Multi-Omics Data Foundation for Drug Target Discovery
2.1. Types of Multi-Omics and Their Biological Significance
Genomics and epigenomics form the foundational layers of biological regulation. Genomics analyzes static DNA-level information, including single-nucleotide variations (SNVs), copy number variations (CNVs), and structural rearrangements, to gain an in-depth understanding of genetic or somatic mutations that may produce actionable dependencies [33,34,35]. While genomic changes are primary indicators of abnormal signaling pathways [36,37], they are intrinsically regulated by the epigenome. Epigenomics captures DNA methylation patterns, histone modifications, chromatin accessibility, and three-dimensional genome structure [38,39,40]. Unlike genetic mutations, the epigenetic state is reversible, making it a highly promising drug target. Epigenome analysis provides mechanistic insights into transcriptional dysregulation and can reveal epigenetic regulation nodes as potential therapeutic intervention points [41,42,43].
Downstream of these regulatory layers, the functional state of the cell is captured by transcriptomics, proteomics, and metabolomics. Transcriptomics characterizes the regulation of gene expression dynamics and RNA levels [9,44]. By quantifying mRNA abundance, alternative splicing, and non-coding RNA activity, it reflects the functional consequences of genomic perturbations and reveals dysregulated pathways or expression-based biomarkers [45,46,47]. However, gene expression does not always correlate with protein levels, necessitating proteomics to quantify protein abundance, post-translational modifications (PTMs), and protein–protein interactions (PPIs) [48,49,50]. Since most therapeutic drugs bind directly to proteins, proteomics accurately reflects drug-targeted biology, identifying pathway bottlenecks and active signaling modules [51,52,53]. Furthermore, metabolomics investigates small-molecule metabolites that reflect the final biochemical state of cells [54,55]. As metabolic reprogramming is a hallmark of many diseases, this layer can reveal regulatory mechanisms of rate-limiting enzymes and metabolic bottlenecks, providing viable drug targets. Complementing these bulk-level profiles, single-cell omics techniques, including scRNA-seq, scATAC-seq, and spatially resolved multi-omics, introduce cellular resolution into target discovery. These methods can identify cell type-specific targets, rare subpopulations, tumor microenvironment interactions, and heterogeneity-driven drug resistance mechanisms [56,57,58]. These omics layers collectively constitute a network of interconnected molecules. Accurate identification of reliable drug targets is made possible by the integration of these multi-omics data, which offers a systemic biological understanding surpassing that of any single approach. A comprehensive summary of these omics layers, detailing their key features and specific implications for drug target discovery, is provided in Table 1.
2.2. Key Databases for Drug Target Discovery
A robust data infrastructure is fundamental for integrating high-dimensional biomedical datasets. Numerous specialized repositories, encompassing genomic variants to protein–ligand affinities and supporting programmatic access via APIs, constitute an essential ecosystem for training AI models (Table 2). The Open Targets Platform serves as a gold standard, integrating multi-source evidence to provide quantified gene–disease association scores. These “ground truth” labels enable supervised deep learning models to characterize druggable targets beyond simple correlations. Additionally, STRING offers curated protein–protein interaction networks, providing the topological foundation for Graph Neural Networks to identify regulatory hubs via network propagation. Together with resources summarized in Table 2, such as GTEx for tissue specificity, DepMap for cancer dependencies, and DrugBank for pharmacological profiles, these databases underpin modern AI-driven target discovery.
3. Strategies and DL Architectures Enabling Multi-Omics-Driven Target Discovery
Multi-omics integration provides a holistic view of disease biology but introduces severe dimensionality and heterogeneity challenges. DL addresses these challenges by learning abstract representations from complex biological networks [78], thereby enabling more precise drug target discovery. This section reviews the integration strategies, dimensionality reduction technologies and the principal DL architectures that currently drive drug target discovery based on multi-omics.
3.1. Strategies for Multi-Omics Integration
The strategic selection between single-omics and multi-omics approaches is governed by a fundamental trade-off between statistical robustness and biological depth. Single-omics models remain appropriate for small-sample regimes to mitigate “the curse of dimensionality” and ensure model stability. However, multi-omics integration is necessitated when resolving the synergistic regulatory mechanisms of complex diseases is paramount. In the context of DL-driven multi-omics integration, DL frameworks typically employ three fusion paradigms (Figure 2). Early integration concatenates raw features into a single input vector but often struggles with heterogeneous data distributions and modality-specific noise [79]. Late integration aggregates predictions from independent models [80,81]; while preserving distinct patterns, it typically overlooks critical synergistic interactions. Intermediate integration has become the primary focus in current research [9]. By fusing latent representations from separate network branches, this strategy effectively characterizes non-linear cross-talk [82], facilitating the extraction of robust biological signals as evidenced by frameworks such as MOGONET and OmiEmbed [44,83].
3.2. Dimensionality Reduction and Manifold Learning
Given the high-dimensional and sparse nature of multi-omics and molecular dynamics data, dimensionality reduction serves as a critical bridge between raw biological information and interpretable feature learning. Beyond standard visualization techniques like Uniform Manifold Approximation and Projection (UMAP) that preserve local topology [84], deep learning has revolutionized the extraction of latent dynamics from molecular simulations. Notably, approaches such as molecular dynamics-based Markov State Models (MD-MSMs) and VAMPnets utilize time-lagged autoencoders to extract slow-process dynamics from high-dimensional trajectories [85,86]. To ensure embedding fidelity, metrics like embedding error analysis and Wasserstein Distance are increasingly used to quantify topological preservation [87]. These methods collectively distill complex, high-dimensional biological noise into robust low-dimensional manifolds, facilitating the precise identification of cryptic allosteric sites and dynamic therapeutic targets.
3.3. DL Architectures
3.3.1. Autoencoders (AEs) and Variational Autoencoders (VAEs)
Autoencoder (AE)-based frameworks serve as a foundational paradigm in multi-omics integration, effectively reconciling high dimensionality and inherent noise through unsupervised latent representation learning [88,89]. Specialized variants, such as Variational Autoencoders (VAEs), facilitate sophisticated non-linear feature extraction while leveraging generative capabilities to augment limited clinical datasets [90,91]. In multi-modal configurations, modality-specific encoders are typically mapped to a shared latent bottleneck, a mechanism illustrated by the VAE architecture in Figure 3a. This configuration enables cross-omics alignment and synergistic representation learning, thereby providing a robust foundation for downstream therapeutic target discovery. Recent advancements underscore the utility of VAEs and AEs in distilling actionable biological insights from multi-omics noise. A recent study proposed a novel method called FactVAE [92]. The model successfully inferred regulatory peaks for TESK2, a key kinase regulating the actin cytoskeleton. Furthermore, FactVAE identified regulatory elements associated with PHACTR4, a gene known to restrict cancer cell proliferation. Similarly, Pan et al. developed i-Modern [93], an interpretable deep learning framework designed to identify therapeutic targets in glioma. By employing an autoencoder for automated feature extraction, the model integrates heterogeneous multi-omics data, including transcriptomics, miRNA expression, somatic mutations, CNVs, DNA methylation, and proteomics. Notable targets identified included high gene expression of DLL3, specific somatic mutations in IDH1, and high protein expression of IGFBP2, all of which correlated with better prognosis. These findings align with established literature, demonstrating the framework’s efficacy in prioritizing biologically relevant targets.
3.3.2. Graph Neural Networks (GNNs)
Given the inherently relational nature of biological systems, Graph Neural Networks (GNNs) such as Graph Convolutional (GCN) and Graph Attention (GAT) architectures are essential for modeling complex interaction networks. As shown in Figure 3b, these models effectively capture non-Euclidean biological structures by transforming topological relationships into feature embeddings for downstream predictions, including drug target discovery. For instance, Wang et al. developed MOGONET [44], a supervised framework that utilizes GCNs for omics-specific learning and a View Correlation Discovery Network (VCDN) for high-level integration. The model identified critical multi-modal biomarkers for Alzheimer’s Disease (AD). In AD, selected features (e.g., APLN, KIF5A, hsa-miR-423) were biologically validated through GO enrichment analysis linking them to Tau phosphorylation, amyloid-β accumulation, and neuroinflammation. Similarly, Zhang et al. proposed MosGraphFlow [94], a novel integrative graph AI model designed to mine signaling targets from multi-omics data. MosGraphFlow captures multi-level molecular interactions to provide a more comprehensive understanding of AD pathogenesis. By integrating diverse omic layers into a cohesive flow-based graph architecture, the model effectively identifies not only potential biomarkers but also critical signaling pathways that could serve as therapeutic targets. In addition, Niu et al. introduced GLIMS [95], a two-stage gradual-learning framework for cancer gene prediction. The method first employs a semi-supervised hierarchical graph neural network to integrate multi-omics data with protein–protein interaction (PPI) networks for initial candidate identification. It then refines these predictions using an unsupervised approach that incorporates co-splicing networks, effectively capturing critical post-transcriptional regulatory mechanisms to outperform state-of-the-art methods. In a parallel study leveraging GCNs to decipher complex biological interactions, Dai et al. developed DriverOmicsNet [96], a Graph Convolutional Network (GCN) framework that integrates multi-omics data with protein–protein interaction (PPI) networks and weighted gene correlation network analysis (WGCNA). The model demonstrated robust predictive performance by identifying key hub genes, such as ANK2 in stomach adenocarcinoma (STAD) and ACTB in skin cutaneous melanoma (SKCM), which are associated with immune checkpoint response and immune cell infiltration, respectively. Moreover, Li et al. developed CGMega [97], an explainable deep learning framework based on graph attention networks. In acute myeloid leukemia (AML), the framework identified 396 candidate genes and revealed patient-specific gene modules, providing high-order mechanistic insights into cancer heterogeneity and development. Ultimately, by explicitly modeling biological interactions, GNNs offer a distinct capability to uncover network-based therapeutic vulnerabilities that conventional flat-data approaches might overlook.
3.3.3. Convolutional Neural Networks (CNNs)
Although traditionally established in image processing, Convolutional Neural Networks (CNNs) have been effectively adapted to extract features from omics data [98]. Figure 3c demonstrates the standard workflow of this architecture. 1D-CNNs are frequently applied to genomic sequence analysis, such as predicting transcription factor binding sites, or for extracting latent features from chemical structures represented as SMILES strings [99,100], whereas 2D-CNNs are essential for capturing spatial patterns in spatial omics or transforming heterogeneous tabular data into structured feature maps [101]. Demonstrating the utility of 1D-CNNs for quantitative omics, Zompola et al. developed Omics-CNN to analyze complex biological datasets [100]. Applied to the study of ischemic stroke, the pipeline revealed a biosignature associated with the sialic acid metabolism pathway, identifying it as a novel mechanism and a potential therapeutic target for atherosclerosis-related diseases. Crucially, the model prioritized KRT15, VPRBP, TNFRSF4, and GORASP2 as the most significant contributing transcripts. While confirming the known genetic association of TNFRSF4, the study specifically highlighted GORASP2 as a potential therapeutic target for ischemic injury within the Golgi apparatus, showcasing the capability of deep learning to uncover novel druggable candidates beyond established knowledge. Regarding 2D-CNNs, Alok Sharma et al. proposed the DeepInsight-3D architecture [102]. This method relies on a structured data-to-image conversion approach, thereby enabling the utilization of Convolutional Neural Networks (CNNs). The model effectively integrates different types of omics data and demonstrates high efficacy in discovering underlying significant genes. Furthermore, the integration of CNNs with prior biological knowledge has demonstrated substantial potential in neurodegenerative disease research. Wang et al. developed a deep joint learning diagnostic model for Alzheimer’s disease (AD) by introducing a novel multimodal fusion feature termed “MRI-p value” [103]. In this framework, 3D fusion images are constructed by incorporating genetic p-values as a priori knowledge into magnetic resonance imaging (MRI) data. The architecture utilizes a dual-branch CNN, in which one branch employs a Residual Network (ResNet) to extract local pathological features, while the other utilizes attention-based convolutions to capture discriminative spatial patterns across diverse brain regions. This model achieved high diagnostic accuracy across AD, mild cognitive impairment (MCI), and healthy controls. Crucially, it identified six novel genetic targets, such as NTM, MAML2, and PCSK5, thereby demonstrating the efficacy of CNNs in translating complex multimodal omics and imaging data into actionable therapeutic insights. In summary, whether applied to sequence data or spatial constructs, CNNs provide a powerful means to capture local dependencies and structural patterns critical for precise target localization.
3.3.4. Transformer and Attention Mechanism
The Transformer architecture has emerged as a premier framework for sequence modeling and multi-omics integration. As illustrated in Figure 4, this architecture leverages multi-head attention to orchestrate the fusion of disparate omic layers, capturing intricate non-linear dependencies while transcending the proximity constraints of traditional models [30]. Within biological contexts, Transformer-based frameworks such as DNABERT excel at deciphering the semantic “language” of genomic and proteomic sequences [104,105]. Moreover, a study employed a transformer-based framework to integrate multi-omics data for drug target discovery. For instance, DeePathNet is a transformer-based deep learning model that integrates multi-omics cancer data with biological pathway information to predict drug response and support pathway-level biomarker discovery [106]. By combining genomic, transcriptomic, and other omics inputs within a transformer architecture that encodes pathway interactions, DeePathNet outperforms traditional models in predicting drug responses and identifying biologically relevant features that may serve as candidate therapeutic targets, demonstrating the utility of transformer-driven multi-omics integration for drug target discovery. Researchers developed Precious1GPT [107], a multimodal transformer-based framework that leverages transfer learning to integrate transcriptomic and methylation data along with metadata. The model utilizes feature importance analysis to identify dual-purpose therapeutic targets potentially implicated in both aging processes and age-associated diseases, demonstrating the versatility of transformers in multi-omics representation learning. Taken together, as the dominant architecture in sequence modeling, transformer-based approaches are poised to revolutionize how we interpret the “language” of multi-omics, offering unprecedented interpretability for future drug target discovery pipelines.
3.3.5. Diffusion Models
Diffusion models serve as a powerful generative framework for multi-omics integration, enabling the joint modeling of heterogeneous modalities and the reconstruction of high-fidelity data from sparse, noisy biological landscapes. By capturing complex high-dimensional distributions, these architectures facilitate robust data imputation and conditional generation. For instance, Janson et al. developed idpGAN to directly generate realistic protein conformational ensembles [108], bypassing the need for computationally expensive physics-based iterative sampling. By learning from molecular mechanics simulations, this model produces physically plausible and energetically favorable conformations for unseen proteins, thereby facilitating granular assessments of target druggability. This rapid synthesis allows researchers to account for protein flexibility and dynamic binding sites, a capability essential for identifying non-obvious therapeutic targets that traditional static structural models often overlook. In another distinct application, Luo et al. introduced scDiffusion [109], a conditional generative framework leveraging Latent Diffusion Models (LDMs) to synthesize high-fidelity single-cell data. This model demonstrates exceptional stability in learning complex distributions, enabling the generation of realistic gene expression profiles even for rare cell types with limited training samples. Notably, by employing a unique gradient interpolation strategy, scDiffusion can simulate out-of-distribution data and intermediate cell states between known phenotypes. Extending this generative paradigm to the spatial dimension, diffusion architectures have also been adapted to address the critical challenge of data sparsity in spatially resolved omics. For instance, Li et al. developed stDiff [110], a conditional diffusion framework that leverages single-cell transcriptomics to impute missing signals in spatial transcriptomics (ST) data. Unlike methods relying on simple cell-to-cell similarity, stDiff models the intrinsic correlations of gene expression abundance through a generative denoising process. This approach successfully preserved cellular topological structures and accurately reconstructed complex spatial patterns across diverse datasets, thereby enabling the precise delineation of tissue boundaries and cell populations. These considerations are crucial for translating computational predictions into clinically actionable insights. These advancements collectively provide a robust computational foundation for predicting high-confidence drug targets.
3.3.6. Comparative Summary of DL Architectures for Multi-Omics-Based Target Discovery
The selection of an optimal deep learning framework is not a one-size-fits-all endeavor but rather a strategic decision that depends on the specific biological hypothesis and data modality under investigation. Each architecture discussed above presents distinct trade-offs in terms of computational efficiency, feature extraction capability, and structural compatibility. To provide a consolidated perspective on model selection, Table 3 presents a comparative analysis of these principal deep learning architectures. This summary outlines their core strengths, inherent limitations, and context-specific data applicability, with particular emphasis on two essential dimensions: scalability to large-scale datasets and the potential for mechanistic interpretability. These considerations are crucial for translating computational predictions into clinically actionable insights.
4. Deep Learning-Enabled Multi-Omics Integration for Drug Target Discovery
The integration of multi-omics via DL has been successfully applied across multiple stages of the modern target discovery pipeline, demonstrating significant practical application value. This section focuses on its specific utility in identification of novel disease drivers, synthetic lethality prediction, target prioritization.
4.1. Identification of Novel Disease Drivers
DL has revolutionized driver identification by prioritizing network-based representation learning over frequency statistics. Graph Neural Networks (GNNs) integrate multi-omics data into the interactome’s topology, identifying “network hubs” that exert systemic influence despite low mutation burdens, thus distinguishing true drivers from passengers [111,112]. Complementing this, Transformers and Variational Autoencoders (VAEs) extend discovery to non-coding and regulatory regions [113,114]. By leveraging self-attention mechanisms to decode long-range sequence dependencies and analyzing latent reconstruction errors, these architectures effectively pinpoint dysregulated elements, uncovering functional drivers that remain invisible to traditional single-omics analyses. Recent studies have demonstrated their ability to pinpoint high-confidence drivers in complex multi-omics landscapes. Ma et al. introduced DeepMAPS [115], a framework utilizing a heterogeneous graph transformer to infer biological networks from single-cell multi-omics data (e.g., CITE-seq and scRNA-ATAC-seq). By jointly embedding cells and genes within a unified graph, the model modeled interpretable cell–gene relations, enabling the precise identification of 13 distinct cell types in lung tumor environments. Crucially, it dynamically integrated chromatin accessibility with gene expression to uncover specific transcription factors (TFs) driving the development states of diffuse large B-cell lymphoma (DSLL), validating these regulatory drivers as potential immuno-therapeutic targets. In parallel, Yang et al. developed Trans-Driver [116], a deep learning model featuring a novel transformer architecture enhanced with kernel-based multi-head self-attention and Dynamic Tanh (DyT) normalization. This design allowed the model to robustly integrate heterogeneous multi-omics features, capturing subtle non-linear associations often missed by standard methods. Applied to TCGA datasets, Trans-Driver identified 269 candidate driver genes with a remarkable 49.1% match rate against the gold-standard Cancer Gene Census (CGC), proving that integrating multi-omics data via advanced attention mechanisms significantly outperforms methods relying solely on somatic mutations. More recently, Huang et al. proposed MOGOLA [117], a supervised multi-omics integration framework that synergizes Graph Convolutional Networks (GCNs) and Graph Attention Networks (GATs) with an Omics-Linked Attention mechanism to optimize feature representation. Crucially, in the analysis of BRCA, the identified biomarkers demonstrated strong alignment with clinically significant pathways and actionable drug targets, thereby reinforcing the rationale for subtype-specific interventions, including HER2-directed therapies. Specifically, the model highlighted dysregulated proliferation characterized by enrichment in the cell cycle and PI3K–Akt signaling pathways. Furthermore, PPI analysis pinpointed key hub genes, including CCNA2, CDK1, and ESR1, which align with established cancer hallmarks and known drug targets. These results corroborate that MOGOLA effectively captures underlying disease-driving mechanisms rather than mere statistical correlations.
Expanding beyond oncology, Dong et al. developed Omicsformer [118], a deep learning framework that integrates transcriptomic, proteomic, and metabolomic data with routine blood analysis for chronic disease risk prediction. The model successfully identified critical molecular drivers across diverse pathological conditions. Specifically, it highlighted HEXIM in cardiac hypertrophy, TG and PC metabolites in cardiovascular and liver diseases, and FOXO3 in Alzheimer’s disease (AD) pathogenesis. These findings validate the model’s capacity to uncover molecular targets regulating oxidative stress, apoptosis, and metabolic reprogramming, underscoring the value of multi-omics integration in systemic risk stratification. Focusing on neurodegenerative pathology, Xie et al. proposed TransFuse [119], a deep fusion model that mimics the dynamic information flow from DNA to RNA and proteins to unravel the molecular mechanisms of AD. By reconstructing interpretable multi-omic sub-networks, the model identified cohesive functional modules that link genetic risk factors to downstream pathology. Notably, it mapped the interaction between the primary AD risk factor APOE and the transcription factor EGR1, while simultaneously capturing the MAPT gene and tau_PHF1_S404 peptide, confirming the critical role of tau phosphorylation in neurofibrillary tangle formation. Beyond established markers, TransFuse elucidated a complex inflammatory axis involving APP, CD44, and EGR1, and highlighted the functional connectivity between the ANGPT2 gene and hub peptide PIK3R1, implicating neuroinflammation and blood–brain barrier integrity in disease progression. These findings were further validated through eQTL analysis, which confirmed that the identified Single-Nucleotide Polymorphisms (SNPs) exert tissue-specific effects in the frontal cortex. Moreover, pathway enrichment revealed significant crosstalk between VEGF and EPH signaling, suggesting that the dysregulation of angiogenesis and synaptic maintenance pathways acts as a fundamental driver of AD pathology. Addressing cardiovascular health, Luo et al. developed CardiOmicScore [120], a multitask deep learning framework for personalized risk assessment of six common cardiovascular diseases (CVDs). Using UK Biobank data, the model utilizes MetNet and ProNet to profile 168 metabolites and 2920 proteins, capturing complex non-linear interactions often missed by traditional models. The resulting MetScore and ProScore are robust predictors that enhance risk stratification up to 15 years prior to disease onset when integrated with clinical factors. Furthermore, the framework identifies critical CVD-related molecular pathways and biomarkers for conditions like heart failure and stroke, demonstrating the capacity of multitask learning to uncover targets for primary prevention and precision medicine. Collectively, These findings demonstrate that by decoding complex, non-linear interactions within multi-omics data, deep learning frameworks can effectively illuminate functional drivers that remain invisible to traditional single-omics analyses.
4.2. Synthetic Lethality Prediction for “Undruggable” Targets
Synthetic lethality (SL) offers a pivotal strategy for targeting historically “undruggable” malignancies by exploiting genotype-specific survival vulnerabilities. By inhibiting synergistic lethal partners, SL induces cell death exclusively in mutant contexts while preserving healthy tissue. DL has fundamentally reshaped this landscape, transitioning from labor-intensive screenings to data-driven computational inference. Unlike traditional methods limited to linear associations, DL frameworks integrate heterogeneous multi-omics data—spanning mutations and transcriptomics—to reconstruct complex genetic dependencies. Central to this innovation are architectures like Graph Neural Networks (GNNs), which map the global topology of molecular interactions. By propagating biological signals across multi-modal networks, these models unveil latent functional connections invisible to reductionist approaches, thereby accelerating the discovery of robust targets for precision oncology [121,122]. Recently, several studies have explored this direction. For example, Fan et al. proposed MLEC-iSL [123], a framework that introduces the concept of “SL connectivity” as an intermediate learning objective. By incorporating a Graph Transformer to capture long-range dependencies, the model bridged the gap between computation and wet-lab validation: a purposely designed CRISPR-Cas9 double-knockout (CDKO) experiment guided by the model achieved a confirmed synthetic lethality rate of 46.8% among the predicted candidates, vastly outperforming the 7.2% success rate of unguided screens. This high concordance confirms that modeling the global connectivity of genetic networks can effectively pinpoint functional partners for specific genetic backgrounds. Similarly, Lee and Nam developed KG-Slomics [124], a relational graph attention network that addresses the challenge of generalization across different cancer types by integrating an extensively updated knowledge graph with cell line-specific multiomics data. By embedding topological information from biological networks alongside genomic features such as gene expression and mutations, the model dynamically captures context-dependent interactions that static networks often miss. KG-SLomics demonstrated superior predictive performance over existing baselines and successfully identified novel therapeutic targets, such as the synthetic lethal relationship between TP53 and PDGFRB, which was further corroborated through patient survival analysis and drug response validation. Furthermore, Fan et al. introduced MVGCN-iSL [125], a multi-view graph convolutional network integrating five biological graph features and multi-omics data to predict cell-specific SL. The model employs max pooling and a deep neural network (DNN) for final prediction. Researchers validated MVGCN-iSL on the K562 cell line (100,128 samples; 1523 SL pairs) and the Jurkat cell line (74,691 samples; 373 SL pairs). Results demonstrated strong predictive performance and robust generalization to novel genes, confirming the effectiveness of integrating multiple graph features and multi-omics data for SL prediction. Addressing the need for context-aware predictions, Pu et al. proposed SLWise [126], a deep learning framework designed to identify cell-line specific SL. By incorporating a self-attention module to integrate cell-specific multi-omics data into graph representations, the model captures dynamic gene relationships. In the A375 cell line, SLWise predicted a novel SL interaction between BCL2L2 and WEE1. Further investigation revealed that the knockdown of these genes led to abnormalities in key drivers CNOT9 and RHOA. Notably, given RHOA’s established function in driving tumorigenesis and metastatic dissemination across diverse malignancies, it is widely recognized as a promising candidate for therapeutic intervention. These findings demonstrate the model’s capacity to uncover mechanisms underlying cell-specific therapeutic vulnerabilities. Collectively, these advancements underscore the capability of deep learning to decode complex, context-dependent genetic dependencies, thereby unlocking new avenues for precision therapy in cancers traditionally deemed undruggable.
4.3. Drug Target Prioritization
Identifying putative therapeutic entities is merely the prelude; the definitive challenge lies in prioritizing candidates to select those with the highest clinical probability. Target prioritization represents a sophisticated multi-criteria decision-making process that DL transforms into a “learning-to-rank” task. Functioning as evidence fusion engines, DL frameworks integrate heterogeneous data streams, encompassing high-dimensional multi-omics features, ranging from transcriptomic variations to proteomic stability, alongside complex network topologies. By employing strategies such as adaptive graph learning and attention mechanisms, these models automatically weigh the reliability of diverse evidence chains. This process effectively filters noisy correlations and quantifies the systemic value of candidates, generating a robust, convergent hierarchy of actionable targets optimized for experimental validation. A recent study by Tripathy et al. proposed GNNRAI [127], an explainable graph neural network framework that integrates multi-omics data with prior knowledge to prioritize Alzheimer’s disease (AD) biomarkers. By calculating an integrated Target Risk Score (TRS), the model successfully identified APP, APOE, LGMN, and LTF as top-ranked candidates, all of which fell within the top 2% of scored genes. Crucially, these rankings align with established pathology: APP and APOE are canonical drivers of amyloid-β (Aβ) and tau aggregation, while LTF serves as a predictor of Aβ burden. Furthermore, the identification of LGMN (δ-secretase) highlights the model’s capacity to pinpoint critical enzymes involved in the pathogenic cleavage of both tau and APP, validating its effectiveness in capturing key disease biodomains. Advancing the interpretability of feature prioritization, Elmarakeby et al. developed P-NET [128], a biologically informed deep learning architecture designed to unify diverse molecular data types, including somatic mutations, copy number variations, and gene fusions, within a single predictive framework. Unlike traditional statistical methods that process features in isolation, P-NET assigns differential weights to these inputs based on their predictive utility. The model innovatively embeds hierarchical prior biological knowledge directly into the neural network structure, creating an interpretable computational graph that facilitates both clinical prediction and biological discovery. By visualizing the model’s internal architecture, researchers can decipher the multi-level biological pathways driving disease progression. In the context of castration-resistant prostate cancer (CRPC), P-NET not only corroborated established drivers such as AR, PTEN, TP53, and RB1 but also prioritized MDM4 as a critical therapeutic target. This discovery was experimentally validated, suggesting that MDM4 inhibitors could serve as an effective precision therapy for metastatic patients harboring wild-type TP53. Similarly, Yang et al. developed PI4AD [129], a computational medicine framework designed to prioritize therapeutic targets for AD by bridging the gap between genetic associations and pathway-level biology. PI4AD successfully recovered clinically validated targets within the top 1% of prioritized genes, assigning high rankings to APP (18th), ESR1 (61st), KIT (95th) and PDGFRB (100th), which are targets of drugs in Phase III trials or approved clinical use. Furthermore, the framework utilized artificial neural networks to construct self-organizing prioritization maps that distinguish AD-specific molecular signatures, characterized by cell motility and neurotrophin signaling, from those of comorbid neuropsychiatric disorders. The study further identified a 51-gene pathway crosstalk network, revealing Ras signaling as a central therapeutic hub; notably, the combinatorial removal of Ras-pathway nodes (HRAS, KRAS, NRAS) and BCL2 was shown to disrupt 49.0% of the network’s connectivity, suggesting a novel strategy to target neuroinflammation and synaptic dysfunction beyond traditional amyloid/tau paradigms. To address the complexity of target identification in neurodegenerative disorders, Tsuji et al. developed a computational framework utilizing a deep autoencoder to prioritize putative target genes for AD [130]. By leveraging the non-linear feature extraction capabilities of deep learning, the study successfully identified key pathogenicity-associated genes, including DLG4, EGFR, RAC1, SYK, PTK2B, and SOCS1. Furthermore, the framework bridged the gap between target discovery and therapeutic intervention by inferring promising candidates for drug repurposing based on these prioritized targets. Notably, the model identified compounds such as tamoxifen, bosutinib, and dasatinib as potential therapeutics for AD. Taken together, these studies exemplify the capacity of deep learning to bridge the gap between high-throughput multi-omics data and the identification of high-confidence, translatable therapeutic targets.
4.4. Summary of Representative Studies
This section presents a comprehensive summary of representative studies that have successfully leveraged DL-enabled multi-omics integration for drug target discovery. Table 4 systematically categorizes these pivotal works, detailing the specific Study/Model, the utilized Datasets, and the underlying DL Architectures (e.g., GNNs, Transformers, VAEs). Crucially, we highlight the Key Discovery & Validation for each entry, demonstrating how these computational frameworks translate complex multi-omics data into actionable biological insights. These studies collectively address critical challenges in the field, encompassing the identification of novel disease drivers, synthetic lethality prediction, and the robust prioritization of therapeutic targets.
5. Challenges and Future Perspectives
Despite the transformative potential of DL in target discovery, the progression from in silico validation to clinical utility remains obstructed by fundamental structural barriers. To dismantle these impediments, a paradigm shift is required—one that moves beyond standard algorithms to address the intrinsic limitations of data sparsity and heterogeneity, resolves the crisis of model opacity, and acknowledges that even biologically relevant targets may lack pharmacological tractability, necessitating a holistic evaluation across biological, chemical, and clinical dimensions.
5.1. Data Sparsity and Heterogeneity
Although deep learning has demonstrated strong capability in modeling complex biological systems, its effectiveness in multi-omics integration is fundamentally constrained by the intrinsic sparsity and heterogeneity of high-dimensional data. Consequently, even sophisticated algorithms struggle to extract robust biological signals from such sparse, high-dimensional landscapes, where data incompleteness inevitably degrades downstream predictive accuracy. To address these specific bottlenecks, the field is pivoting toward Generative AI and Large Multi-Modal Models (LMMs) [138,139]. Rather than relying solely on direct feature aggregation, these generative frameworks tackle data sparsity by augmenting limited datasets while simultaneously overcoming heterogeneity through harmonizing disparate omics features into a unified latent space, thereby enabling more stable, generalizable, and robust predictions in data-scarce and highly heterogeneous scenarios.
5.2. The Interpretability Crisis
Compounding the challenge is the inherent opacity of DL models. High predictive accuracy remains insufficient without mechanistic transparency; the lack of robust Explainable AI (XAI) frameworks to distinguish causality from correlation sustains a significant trust gap between computational predictions and biological reality. This opacity compromises clinical trust, as the inability to provide biological justification for predictions creates a barrier to their integration into real-world therapeutic decision-making. To bridge this divide, the field must pivot from correlation-based learning toward causality-aware XAI frameworks [140,141]. This evolution transforms models from opaque “pattern matchers” into “causal inference engines,” enabling the simulation of in silico perturbations to distinguish biologically viable intervention points from mere statistical artifacts.
5.3. Target Druggability and Validation Hurdles
While deep learning models excel at identifying biologically relevant targets, biological significance does not equate to pharmacological tractability. A predicted target may lack a druggable binding pocket, exhibit unfavorable pharmacokinetic properties, or pose safety liabilities due to structural homology [142]. Consequently, the transition from computational nomination to therapeutically viable intervention remains a critical bottleneck. Furthermore, even high-confidence candidates require resource-intensive wet-lab validation where attrition rates remain substantial despite promising in silico metrics [143]. To bridge these gaps, the field must shift toward a holistic paradigm that evaluates candidates within a multidimensional landscape of biological, chemical, and clinical feasibility, thereby enhancing the translational success of AI-driven discovery.
6. Conclusions
In this review, we systematically summarize the major applications of deep learning-driven multi-omics integration in the field of drug target discovery. Deep learning-enabled multi-omics integration represents a highly promising approach for accelerating precise drug target discovery. We present a complete analytical workflow, beginning with an overview of the foundations of multi-omics data and a systematic examination of integration strategies—spanning early, intermediate, and late integration. Furthermore, we discuss the pivotal role of dimensionality reduction and manifold learning in extracting latent dynamics and interpretable features from high-dimensional datasets. This is followed by a detailed discussion of advanced DL architectures, including autoencoders, graph neural networks, convolutional neural networks, and Transformer-based models, and their respective roles in analyzing heterogeneous multi-omics data. Importantly, our analysis emphasizes the practical utility of DL in addressing key tasks in drug target discovery, particularly in the identification of disease-driving factors, the prediction of synthetic lethality, and the prioritization of therapeutic targets.
Despite these advances, structural barriers regarding data sparsity, heterogeneity, model opacity, and challenges in druggability and experimental validation persist. Future progress hinges on the advancement of Generative AI and LMMs to mitigate data scarcity and harmonize heterogeneity, causal XAI to decipher mechanistic logic, and the development of holistic, multidimensional frameworks that integrate biological, chemical, and clinical insights to systematically evaluate the pharmacological feasibility of identified targets. Surmounting these challenges is essential to reconcile in silico predictions with clinical reality, ultimately catalyzing the development of next-generation precision therapeutics.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Ballard J.L. Wang Z. Li W. Shen L. Long Q. Deep Learning-Based Approaches for Multi-Omics Data Integration and Analysis Bio Data Min.2024173810.1186/s 13040-024-00391-z 39358793 PMC 11446004 · doi ↗ · pubmed ↗
- 2Valous N.A. Popp F. Zörnig I. Jäger D. Charoentong P. Graph Machine Learning for Integrated Multi-Omics Analysis Br. J. Cancer 202413120521110.1038/s 41416-024-02706-738729996 PMC 11263675 · doi ↗ · pubmed ↗
- 3Wekesa J.S. Kimwele M. A Review of Multi-Omics Data Integration through Deep Learning Approaches for Disease Diagnosis, Prognosis, and Treatment Front. Genet.202314119908710.3389/fgene.2023.119908737547471 PMC 10398577 · doi ↗ · pubmed ↗
- 4Born J. Manica M. Trends in Deep Learning for Property-Driven Drug Design CMC 2021287862788610.2174/092986732866621072911572834325627 · doi ↗ · pubmed ↗
- 5Ren F. Aliper A. Chen J. Zhao H. Rao S. Kuppe C. Ozerov I.V. Zhang M. Witte K. Kruse C. A Small-Molecule TNIK Inhibitor Targets Fibrosis in Preclinical and Clinical Models Nat. Biotechnol.202543637510.1038/s 41587-024-02143-038459338 PMC 11738990 · doi ↗ · pubmed ↗
- 6Ahmed F. Soomro A.M. Salih A.R.C. Samantasinghar A. Asif A. Kang I.S. Choi K.H. A Comprehensive Review of Artificial Intelligence and Network Based Approaches to Drug Repurposing in COVID-19Biomed. Pharmacother.202215311335010.1016/j.biopha.2022.11335035777222 PMC 9236981 · doi ↗ · pubmed ↗
- 7Hira M.T. Razzaque M.A. Angione C. Scrivens J. Sawan S. Sarker M. Integrated Multi-Omics Analysis of Ovarian Cancer Using Variational Autoencoders Sci. Rep.202111626510.1038/s 41598-021-85285-433737557 PMC 7973750 · doi ↗ · pubmed ↗
- 8Withnell E. Zhang X. Sun K. Guo Y. X Omi VAE: An Interpretable Deep Learning Model for Cancer Classification Using High-Dimensional Omics Data Brief. Bioinform.202122 bbab 31510.1093/bib/bbab 31534402865 PMC 8575033 · doi ↗ · pubmed ↗
