Extreme Acceleration of Graph Neural Network-based Prediction Models for Quantum Chemistry
Hatem Helal, Jesun Firoz, Jenna Bilbrey, Mario Michael Krell, Tom, Murray, Ang Li, Sotiris Xantheas, Sutanay Choudhury

TL;DR
This paper presents a hardware-software co-design approach that significantly accelerates graph neural network training for molecular property prediction, reducing training time from days to under two hours.
Contribution
It introduces a novel batching algorithm and implementation on Graphcore IPUs, enabling efficient training of GNNs on large molecular datasets.
Findings
Training time reduced from days to under two hours
Improved throughput by eliminating redundant computation
Effective scaling across diverse molecular graph datasets
Abstract
Molecular property calculations are the bedrock of chemical physics. High-fidelity \textit{ab initio} modeling techniques for computing the molecular properties can be prohibitively expensive, and motivate the development of machine-learning models that make the same predictions more efficiently. Training graph neural networks over large molecular databases introduces unique computational challenges such as the need to process millions of small graphs with variable size and support communication patterns that are distinct from learning over large graphs such as social networks. This paper demonstrates a novel hardware-software co-design approach to scale up the training of graph neural networks for molecular property prediction. We introduce an algorithm to coalesce the batches of molecular graphs into fixed size packs to eliminate redundant computation and memory associated with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Computational Drug Discovery Methods · Advanced Graph Neural Networks
