# GARD: Genomic Data-Based Drug Repurposing in Head and Neck Cancer with Large Language Model Validation

**Authors:** Pradham Tanikella, William Nenad, Christophe Courtine, Yifan Dai, Qingying Deng, Baiming Zou, Nosayaba Osazuwa-Peters, Travis P. Schrank, Di Wu

PMC · DOI: 10.3390/cancers18050757 · Cancers · 2026-02-26

## TL;DR

This study introduces GARD, a pipeline that uses genomic data and large language models to identify new drugs for head and neck cancer treatment.

## Contribution

The novel GARD pipeline combines genomic analysis, network expansion, and LLM-based literature validation to accelerate drug repurposing for HNC.

## Key findings

- GARD identified known and emerging drugs like Fostamatinib, Quercetin, and Aspirin for HNC treatment.
- Genes like PIK3CA, SOX2, and TP53 were linked to HNC through genomic and network analysis.
- HPV stratification improved the precision of drug repurposing in HNC subgroups.

## Abstract

Head and neck cancer (HNC) is among the most prevalent and challenging cancers worldwide. Developing new drugs is expensive and time consuming, so this study explored a faster, cost-effective approach utilizing existing medications with established safety profiles: drug repurposing. We developed the GARDpipeline (Genomic Alteration-based Repurposing for Drugs), which utilizes large-scale genomic data from The Cancer Genome Atlas (TCGA) to identify key genomic changes in HNC. These genes are expanded through protein–protein interaction networks to capture related pathways and then validated using evidence from thousands of PubMed articles extracted by large language model (LLM) tools. Finally, validated genes are matched with drugs using the DrugBank database. This approach uncovered both known cancer drugs and promising new candidates. These included targeted therapies such as Fostamatinib, Nintedanib, Brigatinib, Regorafenib, and Lenvatinib, as well as emerging compounds like Artenimol, Quercetin, and Acetylsalicylic Acid (Aspirin). Through a combination of genomic analysis, network expansion, and literature validation, the GARD pipeline offers a powerful way to accelerate personalized cancer treatments while reducing cost and development time.

Background/Objectives: Head and neck cancer (HNC) represents the seventh most common cancer diagnosis globally, yet current treatments, including surgery, radiation, and immunotherapy, have shown limited improvement in outcomes. Drug repurposing offers a cost-effective strategy to identify new therapeutic options by leveraging existing medications with known safety profiles. Within this study, we developed the GARD pipeline (Genomic Alteration-based Repurposing for Drugs), designed to uncover repurposing candidates for HNC using genomic and network-based approaches. Methods: GARD integrates multi-omics data from The Cancer Genome Atlas (TCGA), including copy number variation (CNV) and somatic mutations (SOM). The cohort was stratified by human papillomavirus (HPV) status. Risk-associated genes were identified and then expanded via high-confidence protein–protein interaction (PPI) networks. Top candidate genes were filtered through comprehensive analysis of publicly available literature data in PubMed using LLMs to validate the relationship between the identified genes and HNC. The top risk genes and their network-expanded neighbors were mapped against DrugBank, and through statistical significance testing and literature validation, established significant drug–gene associations. Results: Significant genes associated with HNC, inferred by genomics alteration, were identified across HPV-positive and HPV-negative subgroups, such as PIK3CA, SOX2, TP53, EIF4G1, TLR7, CLDN1, PRKCI, and EPHA2. Further expansion through the PPI network identified other targetable genes such as EGFR, ERBB2, and the FGFRs. Literature-based validation efforts ensured confidence in the gene–disease association. Drug–gene mapping revealed candidates spanning those already in clinical trials for HNC (e.g., Afatinib, Cabozantinib, Dasatinib, Brigatinib, Lenvatinib, Capivasertib, and Erdafitinib) and emerging or repurposing candidates (Amuvatinib, XL765 (Voxtalisib), Golotimod, Artenimol, Quercetin, and Acetylsalicylic Acid), offering opportunities for precision repurposing. Conclusions: The GARD pipeline demonstrates a genomics-driven, network-informed framework for systematic drug repurposing in HNC. HPV stratification enhances precision, literature-based validation strengthens confidence, and integrated drug mapping enables refinement of existing therapies and discovery of novel candidates for personalized treatment strategies. Code Availability: The full implementation of the GARD pipeline, including preprocessing scripts, statistical analysis modules, and visualization tools, is publicly available on GitHub.

## Linked entities

- **Genes:** PIK3CA (phosphatidylinositol-4,5-bisphosphate 3-kinase catalytic subunit alpha) [NCBI Gene 5290], SOX2 (SRY-box transcription factor 2) [NCBI Gene 6657], TP53 (tumor protein p53) [NCBI Gene 7157], EIF4G1 (eukaryotic translation initiation factor 4 gamma 1) [NCBI Gene 1981], TLR7 (toll like receptor 7) [NCBI Gene 51284], CLDN1 (claudin 1) [NCBI Gene 9076], PRKCI (protein kinase C iota) [NCBI Gene 5584], EPHA2 (EPH receptor A2) [NCBI Gene 1969], EGFR (epidermal growth factor receptor) [NCBI Gene 1956], ERBB2 (erb-b2 receptor tyrosine kinase 2) [NCBI Gene 2064]
- **Chemicals:** Fostamatinib (PubChem CID 11671467), Nintedanib (PubChem CID 135423438), Brigatinib (PubChem CID 68165256), Regorafenib (PubChem CID 11167602), Lenvatinib (PubChem CID 9823820), Artenimol (PubChem CID 3000518), Quercetin (PubChem CID 5280343), Acetylsalicylic Acid (PubChem CID 2244), Afatinib (PubChem CID 10184653), Cabozantinib (PubChem CID 25102847), Dasatinib (PubChem CID 3062316), Capivasertib (PubChem CID 25227436), Erdafitinib (PubChem CID 67462786), Amuvatinib (PubChem CID 11282283), XL765 (PubChem CID 16123056), Golotimod (PubChem CID 6992140)
- **Diseases:** Head and neck cancer (MONDO:0005627)

## Full-text entities

- **Genes:** SOX2 (SRY-box transcription factor 2) [NCBI Gene 6657] {aka ANOP3, MCOPS3}, EPHA2 (EPH receptor A2) [NCBI Gene 1969] {aka ARCC2, CTPA, CTPP1, CTRCT6, ECK}, PRKCI (protein kinase C iota) [NCBI Gene 5584] {aka DXS1179E, PKCI, nPKC-iota}, CLDN1 (claudin 1) [NCBI Gene 9076] {aka CLD1, ILVASC, SEMP1}, EIF4G1 (eukaryotic translation initiation factor 4 gamma 1) [NCBI Gene 1981] {aka EIF-4G1, EIF4F, EIF4G, EIF4GI, P220, PARK18}, TP53 (tumor protein p53) [NCBI Gene 7157] {aka BCC7, BMFS5, LFS1, P53, TRP53}, EGFR (epidermal growth factor receptor) [NCBI Gene 1956] {aka ERBB, ERBB1, ERRP, HER1, NISBD2, NNCIS}, ERBB2 (erb-b2 receptor tyrosine kinase 2) [NCBI Gene 2064] {aka CD340, HER-2, HER-2/neu, HER2, MLN 19, MLN-19}, TLR7 (toll like receptor 7) [NCBI Gene 51284] {aka IMD74, SLEB17, TLR7-like}, PIK3CA (phosphatidylinositol-4,5-bisphosphate 3-kinase catalytic subunit alpha) [NCBI Gene 5290] {aka CCM4, CLAPO, CLOVE, CWS5, HMH, MCAP}
- **Diseases:** Cancer (MESH:D009369), HNC (MESH:D006258)
- **Chemicals:** Golotimod (MESH:C470075), Amuvatinib (MESH:C521047), Voxtalisib (MESH:C576808), Quercetin (MESH:D011794), Dasatinib (MESH:D000069439), Afatinib (MESH:D000077716), Lenvatinib (MESH:C531958), Cabozantinib (MESH:C558660), Acetylsalicylic Acid (MESH:D001241), Erdafitinib (MESH:C000604580), Artenimol (MESH:C039060), Brigatinib (MESH:C000598580), Capivasertib (MESH:C575618)
- **Species:** Human papillomavirus (species) [taxon 10566]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12984907/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12984907/full.md

## References

96 references — full list in the complete paper: https://tomesphere.com/paper/PMC12984907/full.md

---
Source: https://tomesphere.com/paper/PMC12984907