PSC-CPI: Multi-Scale Protein Sequence-Structure Contrasting for Efficient and Generalizable Compound-Protein Interaction Prediction
Lirong Wu, Yufei Huang, Cheng Tan, Zhangyang Gao, Bozhen Hu, Haitao, Lin, Zicheng Liu, Stan Z. Li

TL;DR
PSC-CPI introduces a multi-scale contrastive learning framework that jointly models protein sequences and structures at various scales, improving compound-protein interaction prediction especially in challenging scenarios with unseen data or missing modalities.
Contribution
The paper presents a novel multi-scale contrastive framework for CPI prediction that effectively models intra- and cross-modality dependencies across multiple scales, enhancing generalization and robustness.
Findings
PSC-CPI outperforms existing methods in all test settings.
The model maintains strong performance with missing modality data.
Multi-scale contrastive learning improves prediction accuracy in complex scenarios.
Abstract
Compound-Protein Interaction (CPI) prediction aims to predict the pattern and strength of compound-protein interactions for rational drug discovery. Existing deep learning-based methods utilize only the single modality of protein sequences or structures and lack the co-modeling of the joint distribution of the two modalities, which may lead to significant performance drops in complex real-world scenarios due to various factors, e.g., modality missing and domain shifting. More importantly, these methods only model protein sequences and structures at a single fixed scale, neglecting more fine-grained multi-scale information, such as those embedded in key protein fragments. In this paper, we propose a novel multi-scale Protein Sequence-structure Contrasting framework for CPI prediction (PSC-CPI), which captures the dependencies between protein sequences and structures through both…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBioinformatics and Genomic Networks · Microbial Metabolic Engineering and Bioproduction · Machine Learning in Bioinformatics
