# Benchmarking heterogeneous network-based methods for drug repurposing

**Authors:** Thi Trang Nguyen, Yudi Pawitan, Stefano Calza, Trung Nghia Vu

PMC · DOI: 10.1038/s41540-025-00633-8 · 2025-12-10

## TL;DR

This paper benchmarks ten network-based methods for drug repurposing across eight datasets, finding that OMC performs best and highlighting issues with cross-validation strategies in prior studies.

## Contribution

The study introduces a comprehensive benchmarking framework and two new drug-disease datasets for evaluating heterogeneous network-based drug repurposing methods.

## Key findings

- OMC achieves the highest AUC and AUPR across most datasets.
- NMF-PDR outperforms other NMF-based approaches.
- Previous studies overestimated performance due to flawed cross-validation strategies.

## Abstract

Drug repurposing (DR) has gained significant attention as a cost-effective strategy for identifying new therapeutic uses for existing drugs. Heterogeneous network-based methods are particularly promising because they exploit complex biological interactions. However, comprehensive benchmarking across multiple datasets is still needed to assess their reliability and generalizability. We systematically evaluate ten advanced heterogeneous network-based DR methods across eight datasets, including six publicly available and two newly introduced drug-disease datasets. The methods include (i) matrix factorization: NMF, NMF-PDR, NMF-DR, VDA-GKSBMF, (ii) matrix completion: BNNR, OMC, HGIMC, (iii) recommendation systems: IBCF, LIBMF, and (iv) a deep learning approach: DRDM. Performance is assessed using the area under the receiver operating characteristic (AUC) and precision-recall curve (AUPR). We also analyze the impact of data sparsity and compare findings with previous benchmarking studies. Our results reveal that OMC consistently achieves the highest AUC and AUPR across most datasets. BNNR, DRDM, HGIMC, VDA-GKSBMF, and NMF-PDR, also demonstrate competitive performance, with NMF-PDR outperforming other NMF-based approaches. We find that differences in cross-validation strategies substantially impact reported AUPR values, with previous studies overestimating performance by omitting many negative instances. This work provides a reliable benchmarking framework and new datasets, offering insights for future research in DR.

## Full-text entities

- **Genes:** MBD5 (methyl-CpG binding domain protein 5) [NCBI Gene 55777] {aka C2DELq23.1, DEL2Q23.1, MRD1}, MRD2 (Mental retardation, autosomal dominant 2) [NCBI Gene 106865367]
- **Diseases:** PDR (MESH:C564461), Man (MESH:D016750), Diseases (MESH:D004194), MC (MESH:C535501), NMF (MESH:C538347), Symptoms (MESH:D012816), Rare Diseases (MESH:D035583), Inheritance (MESH:D030342), OMC (MESH:C536030), DR (MESH:D000081015)
- **Chemicals:** ChemS (-)
- **Species:** Homo sapiens (human, species) [taxon 9606], Severe acute respiratory syndrome coronavirus 2 (no rank) [taxon 2697049]

## Figures

10 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12804776/full.md

---
Source: https://tomesphere.com/paper/PMC12804776