CPE-Pro: A Structure-Sensitive Deep Learning Method for Protein Representation and Origin Evaluation
Wenrui Gou, Wenhui Ge, Yang Tan, Mingchen Li, Guisheng Fan, and Huiqun Yu

TL;DR
CPE-Pro is a deep learning model that accurately determines the origin of protein structures, distinguishing between experimental and predicted structures, and enhances structural representation through a novel structure-sequence encoding.
Contribution
The paper introduces CPE-Pro, a structure-sensitive deep learning approach for protein origin discrimination, and a new structure-sequence based language model that improves structural feature learning.
Findings
CPE-Pro achieves accurate discrimination of protein structure origins.
Structure-sequence encoding enhances protein feature learning.
Preliminary results show improved structural representations over amino acid-based models.
Abstract
Protein structures are important for understanding their functions and interactions. Currently, many protein structure prediction methods are enriching the structure database. Discriminating the origin of structures is crucial for distinguishing between experimentally resolved and computationally predicted structures, evaluating the reliability of prediction methods, and guiding downstream biological studies. Building on works in structure prediction, We developed a structure-sensitive supervised deep learning model, Crystal vs Predicted Evaluator for Protein Structure (CPE-Pro), to represent and discriminate the origin of protein structures. CPE-Pro learns the structural information of proteins and captures inter-structural differences to achieve accurate traceability on four data classes, and is expected to be extended to more. Simultaneously, we utilized Foldseek to encode protein…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Bioinformatics · Gene expression and cancer classification · Genetics, Bioinformatics, and Biomedical Research
