# Improving Identification of Drug-Target Binding Sites Based on Structures of Targets Using Residual Graph Transformer Network

**Authors:** Shuang-Qing Lv, Xin Zeng, Guang-Peng Su, Wen-Feng Du, Yi Li, Meng-Liang Wen

PMC · DOI: 10.3390/biom15020221 · Biomolecules · 2025-02-03

## TL;DR

This paper introduces RGTsite, a new deep learning model that improves the identification of drug-target binding sites using a residual graph transformer network.

## Contribution

The novel Residual Graph Transformer Network (RGTsite) improves drug-target binding site prediction by fusing sequence and structural features with better performance than existing methods.

## Key findings

- RGTsite outperformed state-of-the-art methods in F1-score and MCC metrics on multiple benchmark datasets.
- Interpretability analysis confirmed RGTsite's effectiveness in identifying drug-target binding sites in real-world cases.

## Abstract

Improving identification of drug-target binding sites can significantly aid in drug screening and design, thereby accelerating the drug development process. However, due to challenges such as insufficient fusion of multimodal information from targets and imbalanced datasets, enhancing the performance of drug-target binding sites prediction models remains exceptionally difficult. Leveraging structures of targets, we proposed a novel deep learning framework, RGTsite, which employed a Residual Graph Transformer Network to improve the identification of drug-target binding sites. First, a residual 1D convolutional neural network (1D-CNN) and the pre-trained model ProtT5 were employed to extract the local and global sequence features from the target, respectively. These features were then combined with the physicochemical properties of amino acid residues to serve as the vertex features in graph. Next, the edge features were incorporated, and the residual graph transformer network (GTN) was applied to extract the more comprehensive vertex features. Finally, a fully connected network was used to classify whether the vertex was a binding site. Experimental results showed that RGTsite outperformed the existing state-of-the-art methods in key evaluation metrics, such as F1-score (F1) and Matthews Correlation Coefficient (MCC), across multiple benchmark datasets. Additionally, we conducted interpretability analysis for RGTsite through the real-world cases, and the results confirmed that RGTsite can effectively identify drug-target binding sites in practical applications.

## Full-text entities

- **Genes:** CLEC3B (C-type lectin domain family 3 member B) [NCBI Gene 7123] {aka MCDR4, TN, TNA}
- **Diseases:** TN (MESH:C579935), injury to people or property (MESH:C000719191)
- **Chemicals:** amino acid (MESH:D000596), ATP (MESH:D000255), PATP- (MESH:C045797), GTN (-)
- **Species:** Homo sapiens (human, species) [taxon 9606]
- **Cell lines:** PATP-NW30-103 — Rattus norvegicus (Rat), Spontaneously immortalized cell line (CVCL_5219), 1OJL-E — Mus musculus (Mouse), Malignant neoplasms of the mouse mammary gland, Cancer cell line (CVCL_L871), 2Z08-A. — Homo sapiens (Human), Melanoma, Cancer cell line (CVCL_C6NJ)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC11853427/full.md

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/PMC11853427/full.md

## References

50 references — full list in the complete paper: https://tomesphere.com/paper/PMC11853427/full.md

---
Source: https://tomesphere.com/paper/PMC11853427