MelissaDL x Breed: Towards Data-Efficient On-line Supervised Training of Multi-parametric Surrogates with Active Learning
Sofya Dymchenko (DATAMOVE), Abhishek Purandare (DATAMOVE), Bruno, Raffin (DATAMOVE)

TL;DR
This paper introduces Breed, an active learning method that enhances data-efficiency and generalization of neural network surrogates for PDEs by focusing training on challenging parameter space regions, demonstrated on 2D heat PDE.
Contribution
It presents Breed, a novel active learning approach using adaptive importance sampling to improve online surrogate training for multi-parametric PDEs.
Findings
Improved surrogate generalization on 2D heat PDE
Reduced computational overhead in training
Enhanced focus on difficult parameter regions
Abstract
Artificial intelligence is transforming scientific computing with deep neural network surrogates that approximate solutions to partial differential equations (PDEs). Traditional off-line training methods face issues with storage and I/O efficiency, as the training dataset has to be computed with numerical solvers up-front. Our previous work, the Melissa framework, addresses these problems by enabling data to be created "on-the-fly" and streamed directly into the training process. In this paper we introduce a new active learning method to enhance data-efficiency for on-line surrogate training. The surrogate is direct and multi-parametric, i.e., it is trained to predict a given timestep directly with different initial and boundary conditions parameters. Our approach uses Adaptive Multiple Importance Sampling guided by training loss statistics, in order to focus NN training on the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsFocus
