# Harnessing AI to fuse phenotypic signatures for drug target identification: progress in computational modeling

**Authors:** Fengming Chen, Ranran Zhao, Xingxing Han, Huan Li, Zhishu Tang

PMC · DOI: 10.1093/bib/bbag045 · Briefings in Bioinformatics · 2026-02-09

## TL;DR

This review explores how AI and gene expression data can predict drug interactions without needing detailed structural information, offering a new way to identify drug targets and speed up drug discovery.

## Contribution

The paper introduces a novel paradigm for drug target identification by integrating phenotypic and gene expression data using AI models.

## Key findings

- Gene expression-based models enable functional inference of drug interactions without structural data.
- Three primary model types are analyzed: biological network-based, association-based, and multimodal integration approaches.
- The review highlights the potential of these models to reduce experimental costs and advance precision medicine.

## Abstract

Computational models integrating large-scale gene expression profiles provide a powerful approach for predicting multi-target drug interactions (DTIs). Unlike traditional experimental and computational methods that often require detailed structural or target-specific information, gene expression-based models leverage reference transcriptional signatures. This enables functional inference of interactions without explicit structural data, offering a valuable strategy in data-limited scenarios. By incorporating phenotypic information, these models bridge phenotype screening and target prediction, establishing a novel paradigm for target identification. This review introduces and compares current target identification methods, emphasizing the unique advantages of gene expression profiling in DTI prediction. We also outline major public databases and their applications. As an effective hypothesis-generation tools, computational DTI models reduce experimental costs, enhance understanding of multi-target mechanisms, and accelerate drug discovery. We categorize and analyze three primary model types utilizing large-scale gene expression data: biological network-based, association-based, and multimodal integration approaches, discussing their respective strengths and limitations. Key challenges and future directions are also addressed, including data integration, algorithm optimization, and multi-omics fusion, to fully realize the potential of gene expression data in multi-target drug prediction. This review offers comprehensive guidance on advanced tools, databases, and methodologies, enabling novel research paths for unbiased multi-target exploration. By linking phenotype screening with computational analysis, this integrative approach is expected to advance precision medicine, especially in uncovering drug mechanisms in complex diseases, offering promising prospects.

## Full-text entities

- **Genes:** GLI1 (GLI family zinc finger 1) [NCBI Gene 2735] {aka GLI, PAPA8, PPD1}, NCBP2 (nuclear cap binding protein subunit 2) [NCBI Gene 22916] {aka CBC2, CBP20, NIP1, PIG55}, STING1 (stimulator of interferon response cGAMP interactor 1) [NCBI Gene 340061] {aka ERIS, MITA, MPYS, NET23, SAVI, STING}, GRN (granulin precursor) [NCBI Gene 2896] {aka CLN11, FTD2, GEP, GP88, PCDGF, PEPI}, CGAS (cyclic GMP-AMP synthase) [NCBI Gene 115004] {aka C6orf150, D4, MB21D1, h-cGAS}, DLAT (dihydrolipoamide S-acetyltransferase) [NCBI Gene 1737] {aka DLTA, E2, PBC, PDC-E2, PDCE2}, ENPP1 (ectonucleotide pyrophosphatase/phosphodiesterase 1) [NCBI Gene 5167] {aka ARHR2, COLED, M6S1, NPP1, NPPS, PC-1}
- **Diseases:** CKD (MESH:D051436), cancer (MESH:D009369), Comorbid Diseases (MESH:D004194), inflammation (MESH:D007249), tubulointerstitial fibrosis (MESH:D005355), neurological disorders (MESH:D009461), metabolic diseases (MESH:D008659), obesity (MESH:D009765), toxicity (MESH:D064420), colorectal cancer (MESH:D015179), Chinese Medicine (MESH:C562377), breast cancer (MESH:D001943), kidney injury (MESH:D007674)
- **Chemicals:** narciclasine (MESH:C010753), nelfinavir (MESH:D019888), imatinib (MESH:D000068877), raloxifene (MESH:D020849), enoxacin (MESH:D015365), metformin (MESH:D008687), methotrexate (MESH:D008727), sildenafil (MESH:D000068677), Ginkgetin (MESH:C077458), CMap (-), tolbutamide (MESH:D014044), berberine (MESH:D001599), purpurogallin (MESH:C026133), hyperforin (MESH:C001654)
- **Species:** Homo sapiens (human, species) [taxon 9606]
- **Cell lines:** MCF7 — Homo sapiens (Human), Invasive breast carcinoma of no special type, Cancer cell line (CVCL_0031), PC3 — Homo sapiens (Human), Prostate carcinoma, Cancer cell line (CVCL_0035)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12885100/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12885100/full.md

## References

212 references — full list in the complete paper: https://tomesphere.com/paper/PMC12885100/full.md

---
Source: https://tomesphere.com/paper/PMC12885100