# Knowledge-aware contrastive heterogeneous molecular graph learning

**Authors:** Mukun Chen, Jia Wu, Shirui Pan, Fu Lin, Bo Du, Xiuwen Gong, Wenbin Hu

PMC · DOI: 10.1371/journal.pcbi.1013008 · PLOS Computational Biology · 2025-05-12

## TL;DR

This paper introduces a new machine learning framework that improves molecular property predictions and drug interaction modeling by integrating multiple perspectives and external knowledge.

## Contribution

The novel framework KCHML uses contrastive learning with heterogeneous molecular graphs to enhance property prediction and drug interaction analysis.

## Key findings

- KCHML outperforms existing models in molecular property prediction tasks.
- The framework effectively captures complex molecular features through three distinct graph views.
- It improves drug-drug interaction prediction accuracy.

## Abstract

Molecular representation learning is pivotal in predicting molecular properties and advancing drug design. Traditional methodologies, which predominantly rely on homogeneous graph encoding, are limited by their inability to integrate external knowledge and represent molecular structures across different levels of granularity. To address these limitations, we propose a paradigm shift by encoding molecular graphs into heterogeneous structures, introducing a novel framework: Knowledge-aware Contrastive Heterogeneous Molecular Graph Learning. This approach leverages contrastive learning to enrich molecular representations with embedded external knowledge. KCHML conceptualizes molecules through three distinct graph views—molecular, elemental, and pharmacological—enhanced by heterogeneous molecular graphs and a dual message-passing mechanism. This design offers a comprehensive representation for property prediction, as well as for downstream tasks such as drug-drug interaction prediction. Extensive benchmarking demonstrates KCHML’s superiority over state-of-the-art molecular property prediction models, underscoring its ability to capture intricate molecular features.

In the field of drug discovery, predicting molecular properties and drug interactions is crucial for developing new medications and ensuring patient safety. Traditional methods for representing molecular structures often fail to incorporate external knowledge and struggle to capture complex interactions at different levels of detail. To address these limitations, we developed a new framework called Knowledge-aware Contrastive Heterogeneous Molecular Graph Learning (KCHML). Our approach integrates information from three perspectives—molecular structure, elemental relationships, and pharmacological data—using advanced machine learning techniques. This combination allows for a more detailed and accurate representation of molecules, leading to better predictions of molecular properties and drug interactions. By improving how we model and understand molecules, our work has the potential to streamline drug development and reduce the risk of harmful drug interactions, contributing to safer and more effective treatments.

## Full-text entities

- **Diseases:** toxicity (MESH:D064420), DDI (MESH:D000081015), MPNN (MESH:D015441), migraine (MESH:D008881)
- **Chemicals:** O (MESH:D010100), Acetal (MESH:D000080), N (MESH:D009584), Nitrile (MESH:D009570), Ancitabine (MESH:D003504), Nitro (-), Cytarabine (MESH:D003561), chlorine (MESH:D002713), Thymidine (MESH:D013936), benzene (MESH:D001554), Frovatriptan (MESH:C108128), C (MESH:D002244)
- **Species:** Homo sapiens (human, species) [taxon 9606], Human immunodeficiency virus 1 (no rank) [taxon 11676]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12068650/full.md

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12068650/full.md

## References

28 references — full list in the complete paper: https://tomesphere.com/paper/PMC12068650/full.md

---
Source: https://tomesphere.com/paper/PMC12068650