Coupling streaming AI and HPC ensembles to achieve 100-1000x faster biomolecular simulations
Alexander Brace, Igor Yakushin, Heng Ma, Anda Trifan, Todd Munson, Ian, Foster, Arvind Ramanathan, Hyungro Lee, Matteo Turilli, Shantenu Jha

TL;DR
DeepDriveMD leverages machine learning to significantly accelerate molecular dynamics simulations on large HPC systems, enabling more extensive and faster biomolecular research.
Contribution
This paper introduces DeepDriveMD, a novel framework that couples ML with HPC to achieve 100-1000x speedups in biomolecular simulations, a substantial advancement over existing methods.
Findings
Achieves 100-1000x acceleration in protein folding simulations
Supports large-scale simulations on up to 1020 HPC nodes
Enables exploration of longer time scales and conformational landscapes
Abstract
Machine learning (ML)-based steering can improve the performance of ensemble-based simulations by allowing for online selection of more scientifically meaningful computations. We present DeepDriveMD, a framework for ML-driven steering of scientific simulations that we have used to achieve orders-of-magnitude improvements in molecular dynamics (MD) performance via effective coupling of ML and HPC on large parallel computers. We discuss the design of DeepDriveMD and characterize its performance. We demonstrate that DeepDriveMD can achieve between 100-1000x acceleration for protein folding simulations relative to other methods, as measured by the amount of simulated time performed, while covering the same conformational landscape as quantified by the states sampled during a simulation. Experiments are performed on leadership-class platforms on up to 1020 nodes. The results establish…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Data Storage Technologies · Genomics and Phylogenetic Studies · Scientific Computing and Data Management
