Reducing Down(stream)time: Pretraining Molecular GNNs using   Heterogeneous AI Accelerators

Jenna A. Bilbrey; Kristina M. Herman; Henry Sprueill; Soritis S.; Xantheas; Payel Das; Manuel Lopez Roldan; Mike Kraus; Hatem Helal; Sutanay; Choudhury

arXiv:2211.04598·cs.LG·November 10, 2022

Reducing Down(stream)time: Pretraining Molecular GNNs using Heterogeneous AI Accelerators

Jenna A. Bilbrey, Kristina M. Herman, Henry Sprueill, Soritis S., Xantheas, Payel Das, Manuel Lopez Roldan, Mike Kraus, Hatem Helal, Sutanay, Choudhury

PDF

Open Access 1 Repo

TL;DR

This paper demonstrates that using Graphcore IPUs accelerates pretraining of molecular GNNs on large datasets, significantly reducing training time and enabling efficient transfer learning for molecular tasks.

Contribution

It introduces the use of heterogeneous AI accelerators for pretraining molecular GNNs, achieving substantial speedups and enabling practical transfer learning in chemistry.

Findings

01

Training time reduced from 2.7 days to 1.2 hours on large datasets.

02

Finetuning for downstream tasks completed in under 30 minutes.

03

Pretrained models effectively transfer to different molecular tasks.

Abstract

The demonstrated success of transfer learning has popularized approaches that involve pretraining models from massive data sources and subsequent finetuning towards a specific task. While such approaches have become the norm in fields such as natural language processing, implementation and evaluation of transfer learning approaches for chemistry are in the early stages. In this work, we demonstrate finetuning for downstream tasks on a graph neural network (GNN) trained over a molecular database containing 2.7 million water clusters. The use of Graphcore IPUs as an AI accelerator for training molecular GNNs reduces training time from a reported 2.7 days on 0.5M clusters to 1.2 hours on 2.7M clusters. Finetuning the pretrained model for downstream tasks of molecular dynamics and transfer to a different potential energy surface took only 8.3 hours and 28 minutes, respectively, on a single…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

pnnl/downstream_mol_gnn
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning in Materials Science · Advanced Graph Neural Networks · Topic Modeling

MethodsGraph Neural Network