{\lambda}-Scaled-Attention: A Novel Fast Attention Mechanism for   Efficient Modeling of Protein Sequences

Ashish Ranjan; Md Shah Fahad; Akshay Deepak

arXiv:2201.02912·cs.LG·August 29, 2022

{\lambda}-Scaled-Attention: A Novel Fast Attention Mechanism for Efficient Modeling of Protein Sequences

Ashish Ranjan, Md Shah Fahad, Akshay Deepak

PDF

TL;DR

This paper introduces a {1}-scaled attention mechanism that improves the efficiency and effectiveness of protein sequence modeling, addressing issues like vanishing scores and high distribution variance, leading to better protein function prediction.

Contribution

The paper proposes a novel {1}-scaled attention technique specifically designed for protein sequences, enhancing performance and training efficiency over standard attention methods.

Findings

01

Significant F1 score improvements over standard attention and ProtVecGen-Plus.

02

Faster convergence, halving training epochs needed.

03

Lower training-validation loss gap indicating efficient learning.

Abstract

Attention-based deep networks have been successfully applied on textual data in the field of NLP. However, their application on protein sequences poses additional challenges due to the weak semantics of the protein words, unlike the plain text words. These unexplored challenges faced by the standard attention technique include (i) vanishing attention score problem and (ii) high variations in the attention distribution. In this regard, we introduce a novel {\lambda}-scaled attention technique for fast and efficient modeling of the protein sequences that addresses both the above problems. This is used to develop the {\lambda}-scaled attention network and is evaluated for the task of protein function prediction implemented at the protein sub-sequence level. Experiments on the datasets for biological process (BP) and molecular function (MF) showed significant improvements in the F1 score…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.