Transfer Learning on Multi-Fidelity Data
Dong H. Song, Daniel M. Tartakovsky

TL;DR
This paper introduces a multi-fidelity transfer learning approach using CNNs to efficiently train surrogates for complex PDE systems, significantly reducing computational costs while maintaining accuracy.
Contribution
It develops a multi-fidelity training strategy guided by multilevel Monte Carlo theory, optimizing data use for CNN surrogates of nonlinear PDEs with uncertain parameters.
Findings
Multi-fidelity data improves training efficiency.
Optimal data mixture balances speed and accuracy.
Method outperforms high-fidelity only training and Monte Carlo solutions.
Abstract
Neural networks (NNs) are often used as surrogates or emulators of partial differential equations (PDEs) that describe the dynamics of complex systems. A virtually negligible computational cost of such surrogates renders them an attractive tool for ensemble-based computation, which requires a large number of repeated PDE solves. Since the latter are also needed to generate sufficient data for NN training, the usefulness of NN-based surrogates hinges on the balance between the training cost and the computational gain stemming from their deployment. We rely on multi-fidelity simulations to reduce the cost of data generation for subsequent training of a deep convolutional NN (CNN) using transfer learning. High- and low-fidelity images are generated by solving PDEs on fine and coarse meshes, respectively. We use theoretical results for multilevel Monte Carlo to guide our choice of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
