Scoreformer: A Surrogate Model For Large-Scale Prediction of Docking Scores
\'Alvaro Ciudad, Adri\'an Morales-Pastor, Laura Malo, Isaac, Filella-Merc\`e, Victor Guallar, Alexis Molina

TL;DR
ScoreFormer is a graph transformer model that predicts molecular docking scores with high accuracy and efficiency, significantly improving virtual screening in drug discovery by covering more chemical space and reducing inference time.
Contribution
We introduce ScoreFormer, a novel graph transformer architecture with PNA and LRWPE, achieving superior docking score prediction and faster inference compared to existing models.
Findings
Achieves competitive docking score prediction accuracy.
Reduces inference time by 1.65 times.
Demonstrates robustness across multiple datasets.
Abstract
In this study, we present ScoreFormer, a novel graph transformer model designed to accurately predict molecular docking scores, thereby optimizing high-throughput virtual screening (HTVS) in drug discovery. The architecture integrates Principal Neighborhood Aggregation (PNA) and Learnable Random Walk Positional Encodings (LRWPE), enhancing the model's ability to understand complex molecular structures and their relationship with their respective docking scores. This approach significantly surpasses traditional HTVS methods and recent Graph Neural Network (GNN) models in both recovery and efficiency due to a wider coverage of the chemical space and enhanced performance. Our results demonstrate that ScoreFormer achieves competitive performance in docking score prediction and offers a substantial 1.65-fold reduction in inference time compared to existing models. We evaluated ScoreFormer…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMaritime Navigation and Safety
MethodsAttention Is All You Need · Residual Connection · Softmax · Layer Normalization · Laplacian EigenMap · Byte Pair Encoding · Label Smoothing · Adam · Linear Layer · Multi-Head Attention
