libmolgrid: GPU Accelerated Molecular Gridding for Deep Learning Applications
Jocelyn Sunseri, David Ryan Koes

TL;DR
libmolgrid is a GPU-accelerated library that efficiently represents three-dimensional molecules as multidimensional arrays, supporting advanced data handling and integration with deep learning frameworks for molecular modeling.
Contribution
It introduces a versatile, GPU-accelerated library for 3D molecular representation, data augmentation, and seamless integration with deep learning frameworks, enhancing molecular machine learning workflows.
Findings
Supports complex 3D molecular representations preserving spatial info
Enables efficient data batching and augmentation for deep learning
Provides GPU acceleration for improved performance
Abstract
There are many ways to represent a molecule as input to a machine learning model and each is associated with loss and retention of certain kinds of information. In the interest of preserving three-dimensional spatial information, including bond angles and torsions, we have developed libmolgrid, a general-purpose library for representing three-dimensional molecules using multidimensional arrays. This library also provides functionality for composing batches of data suited to machine learning workflows, including data augmentation, class balancing, and example stratification according to a regression variable or data subgroup, and it further supports temporal and spatial recurrences over that data to facilitate work with recurrent neural networks, dynamical data, and size extensive modeling. It was designed for seamless integration with popular deep learning frameworks, including Caffe,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Computational Drug Discovery Methods · Protein Structure and Dynamics
