Improving Signal-to-Noise Ratio of Drug Fragment Screening with Variational Autoencoder
Phyllis Zhang, Minhuan Li, Daniel Keedy, Tamar Skaist Mehlman, Doeke Hekstra

TL;DR
This paper introduces VALDO, a machine learning method that improves the detection of drug fragments binding to proteins by reducing noise in crystallographic data.
Contribution
VALDO uses a variational autoencoder to enhance signal-to-noise ratios in fragment screening, outperforming existing methods.
Findings
VALDO effectively filters out crystal heterogeneity to reveal meaningful ligand binding signals.
VALDO outperforms PanDDA and Cluster4x in detecting and estimating the pose of bound drug fragments.
The method reconstructs an apo state, enabling clearer difference maps for ligand identification.
Abstract
In the quest for new drug candidates, a pivotal phase involves identifying compounds that selectively and robustly bind to their targets to modulate activity for therapeutic effects. This modulation can manifest as inhibition, activation, or allosteric regulation, among others. A core challenge in drug discovery is detecting ligands with high binding affinity to target proteins. Techniques range from high-throughput screening and computational simulations to advanced machine learning models. Fragment-based drug discovery (FBDD), particularly using X-ray crystallography beamlines, has become a prominent method for finding initial leads for small-molecule modulators. This involves soaking potential ligands into protein crystals, followed by X-ray diffraction data analysis to detect binding fragments. Despite technological advancements enhancing throughput, variations in crystals…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational Drug Discovery Methods · Spectroscopy and Chemometric Analyses
