Deep Generative Models that Solve PDEs: Distributed Computing for   Training Large Data-Free Models

Sergio Botelho; Ameya Joshi; Biswajit Khara; Soumik Sarkar; Chinmay; Hegde; Santi Adavani; Baskar Ganapathysubramanian

arXiv:2007.12792·cs.LG·July 28, 2020

Deep Generative Models that Solve PDEs: Distributed Computing for Training Large Data-Free Models

Sergio Botelho, Ameya Joshi, Biswajit Khara, Soumik Sarkar, Chinmay, Hegde, Santi Adavani, Baskar Ganapathysubramanian

PDF

TL;DR

This paper introduces a distributed deep learning framework that enables training large-scale neural PDE solvers efficiently, overcoming previous computational limitations and demonstrating practical applicability in scientific computing.

Contribution

The paper presents a scalable software framework for distributed training of large neural PDE models, including novel features like loss integrity and distributed optimization methods.

Findings

01

Framework scales well on cloud and HPC clusters.

02

Distributed higher-order optimization is 2-3x faster than SGD.

03

Neural PDE solvers trained at unprecedented sizes.

Abstract

Recent progress in scientific machine learning (SciML) has opened up the possibility of training novel neural network architectures that solve complex partial differential equations (PDEs). Several (nearly data free) approaches have been recently reported that successfully solve PDEs, with examples including deep feed forward networks, generative networks, and deep encoder-decoder networks. However, practical adoption of these approaches is limited by the difficulty in training these models, especially to make predictions at large output resolutions ( $\geq 1024 \times 1024$ ). Here we report on a software framework for data parallel distributed deep learning that resolves the twin challenges of training these large SciML models - training in reasonable time as well as distributing the storage requirements. Our framework provides several out of the box functionality including (a) loss…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.