Adaptive Residue-wise Profile Fusion for Low Homologous Protein SecondaryStructure Prediction Using External Knowledge
Qin Wang, Jun Wei, Boyuan Wang, Zhen Li1, Sheng Wang, Shuguang Cu

TL;DR
This paper introduces an adaptive residue-wise profile fusion method utilizing external self-supervised BERT features to improve protein secondary structure prediction for low homologous proteins, outperforming existing methods.
Contribution
It proposes a novel residue-wise attention mechanism and feature consistency loss to effectively fuse profile and BERT features, enhancing prediction accuracy in low homologous scenarios.
Findings
Outperforms state-of-the-art methods by 4.7% on BC40 dataset for low homologous proteins.
Demonstrates the superiority of profile over PSSM in low homologous PSSP.
Validates the effectiveness of BERT-based pseudo profiles and adaptive fusion in improving prediction accuracy.
Abstract
Protein secondary structure prediction (PSSP) is essential for protein function analysis. However, for low homologous proteins, the PSSP suffers from insufficient input features. In this paper, we explicitly import external self-supervised knowledge for low homologous PSSP under the guidance of residue-wise profile fusion. In practice, we firstly demonstrate the superiority of profile over Position-Specific Scoring Matrix (PSSM) for low homologous PSSP. Based on this observation, we introduce the novel self-supervised BERT features as the pseudo profile, which implicitly involves the residue distribution in all native discovered sequences as the complementary features. Further-more, a novel residue-wise attention is specially designed to adaptively fuse different features (i.e.,original low-quality profile, BERT based pseudo profile), which not only takes full advantage of each feature…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Bioinformatics · Protein Structure and Dynamics · RNA and protein synthesis mechanisms
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · Attention Is All You Need · Linear Layer · WordPiece · Weight Decay · Softmax · Dense Connections · Adam · Layer Normalization
