# Node2Vec-DGI-EL: a hierarchical graph representation learning model for ingredient–disease association prediction

**Authors:** Leifeng Zhang, Xin Dong, Shuaibing Jia, Jianhua Zhang

PMC · DOI: 10.1093/bioadv/vbaf216 · Bioinformatics Advances · 2025-12-10

## TL;DR

This paper introduces a new model for predicting how traditional Chinese medicine ingredients might treat diseases, using advanced graph learning techniques.

## Contribution

The novel contribution is a hierarchical graph representation learning model combining Node2Vec, DGI, and ensemble learning for ingredient–disease association prediction.

## Key findings

- The model achieved an AUC of 0.9987 and an AUPR of 0.9545, outperforming existing methods.
- Case studies confirmed triptonide and methyl ursolate's strong binding energies with disease-related targets.

## Abstract

Traditional Chinese medicine, as an essential component of traditional medicine, contains active ingredients that serve as a crucial source for modern drug development. To explore the potential application value of traditional Chinese medicine ingredients, this study utilizes the complex network formed between herbs, ingredients, targets, and diseases, and proposes an ingredient–disease association prediction model (Node2Vec-DGI-EL) based on hierarchical graph representation learning. The model first utilized Node2Vec to extract node embedding vectors, serving as the initial features for the network nodes. Then, DGI was applied to further refine the node representations, enhancing the model’s expressive power. Finally, an ensemble learning method was integrated to further improve prediction performance.

The proposed model significantly outperformed existing methods, achieving an AUC of 0.9987 and an AUPR of 0.9545. Case studies further validated the reliability of the model’s predictive results. Specifically, triptonide exhibited a binding energy of −9.62 kcal/mol with PGR, a core target of hypertensive retinopathy, while methyl ursolate showed a binding energy of −9.71 kcal/mol with NFE2L2, a core target of colorectal cancer. The Node2Vec-DGI-EL model focuses on traditional Chinese medicine datasets, effectively predicting ingredient–disease associations. It demonstrates significant application value and can assist in drug repositioning and novel drug development.

The code and data are available at https://github.com/wayfarer569/Node2Vec-DGI-EL.

## Linked entities

- **Proteins:** PGR (progesterone receptor), NFE2L2 (NFE2 like bZIP transcription factor 2)
- **Chemicals:** triptonide (PubChem CID 65411), methyl ursolate (PubChem CID 636516)
- **Diseases:** hypertensive retinopathy (MONDO:0006797), colorectal cancer (MONDO:0005575)

## Full-text entities

- **Genes:** PGR (progesterone receptor) [NCBI Gene 5241] {aka NR3C3, PR}, NFE2L2 (NFE2 like bZIP transcription factor 2) [NCBI Gene 4780] {aka IMDDHH, NRF2, Nrf-2}
- **Diseases:** colorectal cancer (MESH:D015179), hypertensive retinopathy (MESH:D058437)
- **Chemicals:** triptonide (MESH:C084079), methyl ursolate (MESH:C454321)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC13043271/full.md

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC13043271/full.md

## References

48 references — full list in the complete paper: https://tomesphere.com/paper/PMC13043271/full.md

---
Source: https://tomesphere.com/paper/PMC13043271