PepLand: a large-scale pre-trained peptide representation model for a comprehensive landscape of both canonical and non-canonical amino acids
Ruochi Zhang (1,2,3), Haoran Wu (3), Yuting Xiu (3), Kewei Li (1,4),, Ningning Chen (3), Yu Wang (3), Yan Wang (1,2,4), Xin Gao (5,6,7), Fengfeng, Zhou (1,4,7) ((1) Key Laboratory of Symbolic Computation, Knowledge, Engineering of Ministry of Education, Jilin University

TL;DR
PepLand is a novel pre-trained model utilizing a multi-view heterogeneous graph neural network to effectively represent and analyze both canonical and non-canonical peptides, advancing peptide research.
Contribution
Introduces PepLand, a comprehensive pre-training architecture specifically designed for complex peptide sequences with canonical and non-canonical amino acids.
Findings
Effective in predicting peptide properties such as interactions, permeability, and solubility.
Outperforms existing models in capturing structural features of peptides.
Provides publicly available code for reproducibility.
Abstract
In recent years, the scientific community has become increasingly interested on peptides with non-canonical amino acids due to their superior stability and resistance to proteolytic degradation. These peptides present promising modifications to biological, pharmacological, and physiochemical attributes in both endogenous and engineered peptides. Notwithstanding their considerable advantages, the scientific community exhibits a conspicuous absence of an effective pre-trained model adept at distilling feature representations from such complex peptide sequences. We herein propose PepLand, a novel pre-training architecture for representation and property analysis of peptides spanning both canonical and non-canonical amino acids. In essence, PepLand leverages a comprehensive multi-view heterogeneous graph neural network tailored to unveil the subtle structural representations of peptides.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsChemical Synthesis and Analysis · Antimicrobial Peptides and Activities · Machine Learning in Bioinformatics
MethodsGraph Neural Network
