ESM-NBR: fast and accurate nucleic acid-binding residue prediction via protein language model feature representation and multi-task learning
Wenwu Zeng, Dafeng Lv, Wenjuan Liu, Shaoliang Peng

TL;DR
ESM-NBR is a rapid, sequence-based method that leverages protein language models and multi-task learning to accurately predict nucleic acid-binding residues, outperforming traditional methods in speed and accuracy.
Contribution
The paper introduces ESM-NBR, a novel approach combining protein language model features with multi-task deep learning for improved nucleic acid-binding residue prediction.
Findings
Outperforms HMM-based features in prediction accuracy.
Achieves higher MCC scores than existing methods.
Significantly faster prediction speed, about 16 times quicker.
Abstract
Protein-nucleic acid interactions play a very important role in a variety of biological activities. Accurate identification of nucleic acid-binding residues is a critical step in understanding the interaction mechanisms. Although many computationally based methods have been developed to predict nucleic acid-binding residues, challenges remain. In this study, a fast and accurate sequence-based method, called ESM-NBR, is proposed. In ESM-NBR, we first use the large protein language model ESM2 to extract discriminative biological properties feature representation from protein primary sequences; then, a multi-task deep learning model composed of stacked bidirectional long short-term memory (BiLSTM) and multi-layer perceptron (MLP) networks is employed to explore common and private information of DNA- and RNA-binding residues with ESM2 feature as input. Experimental results on benchmark data…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational Drug Discovery Methods · RNA and protein synthesis mechanisms · Machine Learning in Bioinformatics
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
