Boosting Convolutional Neural Networks' Protein Binding Site Prediction Capacity Using SE(3)-invariant transformers, Transfer Learning and Homology-based Augmentation
Daeseok Lee, Jeunghyun Byun, Bonggun Shin

TL;DR
This paper introduces a novel deep learning approach for predicting protein binding sites, leveraging SE(3)-invariant transformers, transfer learning, and homology-based augmentation, achieving state-of-the-art results in both pocket and residue resolution predictions.
Contribution
It presents a new architecture with SE(3)-invariant self-attention, transfer learning between resolutions, and a homology-based augmentation method, advancing protein binding site prediction techniques.
Findings
Outperformed all state-of-the-art baselines in binding site prediction
Demonstrated good performance on a case study with human serum albumin
Model effectively used transfer learning and homology augmentation to improve accuracy
Abstract
Figuring out small molecule binding sites in target proteins, in the resolution of either pocket or residue, is critical in many virtual and real drug-discovery scenarios. Since it is not always easy to find such binding sites based on domain knowledge or traditional methods, different deep learning methods that predict binding sites out of protein structures have been developed in recent years. Here we present a new such deep learning algorithm, that significantly outperformed all state-of-the-art baselines in terms of the both resolutionspocket and residue. This good performance was also demonstrated in a case study involving the protein human serum albumin and its binding sites. Our algorithm included new ideas both in the model architecture and in the training method. For the model architecture, it incorporated SE(3)-invariant geometric self-attention layers that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational Drug Discovery Methods · Protein Structure and Dynamics · Machine Learning in Bioinformatics
