# Topological deep learning for enhancing peptide-protein complex prediction

**Authors:** Xuhang Dai, Rui Wang, Yingkai Zhang

PMC · DOI: 10.1038/s42004-025-01727-4 · Communications Chemistry · 2025-11-12

## TL;DR

This paper introduces TopoDockQ and ResidueX to improve peptide-protein complex predictions and design.

## Contribution

TopoDockQ uses topological deep learning to reduce false positives in peptide-protein modeling.

## Key findings

- TopoDockQ reduces false positives by 42% compared to AlphaFold2's confidence score.
- ResidueX enables integration of non-canonical amino acids into peptide scaffolds.
- The tools enhance precision and recall in peptide-protein complex predictions.

## Abstract

Peptide-protein interactions are essential to biological processes and drug discovery, but selecting high-quality models from predicted complexes remains challenging due to high false positive rates (FPR). Here we introduce TopoDockQ, a topological deep learning model leveraging persistent combinatorial Laplacian (PCL) features to predict DockQ scores (p-DockQ) for accurately evaluating peptide-protein interface quality, aimed at enhancing precision and mitigating FPR in model selection. Compared to AlphaFold2’s built-in confidence score, TopoDockQ reduces false positives by at least 42% and increases precision by 6.7% across five evaluation datasets filtered to ≤70% peptide-protein sequence identity, while maintaining relatively high recall and F1 scores. To support flexible peptide design, we introduce ResidueX, a workflow incorporating non-canonical amino acids (ncAA) into peptide scaffolds. Together, TopoDockQ and ResidueX advance peptide-protein modeling by refining confidence scoring and supporting ncAA incorporation, enabling precise, customizable design and accelerating next-generation peptide therapeutics development.

Peptide-protein interactions are crucial for biological processes, yet accurate structural modeling remains challenging. Here, the authors introduce TopoDockQ, a topological deep learning model to enhance model selection, and ResidueX, a workflow for non-canonical amino acids integrating into custom peptide scaffolds, which represent a synergistic advancement in peptide-protein modeling and enhance more precise and versatile peptide design.

## Full-text entities

- **Diseases:** death (MESH:D003643), TDL (MESH:D007859), PCL (MESH:D000088562)
- **Chemicals:** acids (MESH:D000143), O (MESH:D010100), tyrosine (MESH:D014443), C (MESH:D002244), water (MESH:D014867), amino acid (MESH:D000596), gold (MESH:D006046), CA (MESH:D002118), N (MESH:D009584), peptides (MESH:D010455), metal (MESH:D008670), DockQ (-)
- **Cell lines:** ncAA-1 — Mus musculus (Mouse), Hybridoma (CVCL_B6LS)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12612092/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12612092/full.md

## References

9 references — full list in the complete paper: https://tomesphere.com/paper/PMC12612092/full.md

---
Source: https://tomesphere.com/paper/PMC12612092