Very-Large-Scale GPU-Accelerated Nuclear Gradient of Time-Dependent Density Functional Theory with Tamm-Dancoff Approximation and Range-Separated Hybrid Functionals
Inkoo Kim, Daun Jeong, Leah Weisburn, Alexandra Alexiu, Troy Van, Voorhis, Young Min Rhee, Won-Joon Son, Hyung-Jin Kim, Jinkyu Yim, Sungmin, Kim, Yeonchoo Cho, Inkook Jang, Seungmin Lee, and Dae Sin Kim

TL;DR
This paper introduces a highly efficient multi-GPU implementation for calculating nuclear gradients in large-scale time-dependent density functional theory, enabling complex biological system simulations with high parallel efficiency.
Contribution
It presents a novel GPU-accelerated algorithm for TDDFT nuclear gradients using Tamm-Dancoff approximation and range-separated hybrid functionals, optimized for large-scale systems.
Findings
Achieved over 70% parallel efficiency on 64 GPUs
Successfully computed gradients for a 4353-atom protein system
Demonstrated scalability on high-performance GPU clusters
Abstract
Modern graphics processing units (GPUs) provide an unprecedented level of computing power. In this study, we present a high-performance, multi-GPU implementation of the analytical nuclear gradient for Kohn-Sham time-dependent density functional theory (TDDFT), employing the Tamm-Dancoff approximation (TDA) and Gaussian-type atomic orbitals as basis functions. We discuss GPU-efficient algorithms for the derivatives of electron repulsion integrals and exchange-correlation functionals within the range-separated scheme. As an illustrative example, we calculated the TDA-TDDFT gradient of the S1 state of a full-scale green fluorescent protein with explicit water solvent molecules, totaling 4353 atoms, at the wB97X/def2-SVP level of theory. Our algorithm demonstrates favorable parallel efficiencies on a high-speed distributed system equipped with 256 Nvidia A100 GPUs, achieving >70% with up to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced NMR Techniques and Applications · Matrix Theory and Algorithms
