An adaptive sampling algorithm for data-generation to build a data-manifold for physical problem surrogate modeling
Chetra Mang, Axel TahmasebiMoradi, David Danan, Mouadh Yagoubi

TL;DR
This paper introduces an adaptive sampling algorithm that iteratively generates input data to better represent the response manifold of physical models, improving surrogate model accuracy over traditional methods like Latin Hypercube Sampling.
Contribution
The novel adaptive sampling algorithm (ASADG) enhances data generation for physical models, leading to more accurate surrogate models by iteratively refining input data based on the manifold's geometry.
Findings
ASADG outperforms LHS in representing the response manifold.
The algorithm achieves better surrogate model accuracy with the same data size.
Efficient data generation for complex physical models demonstrated.
Abstract
Physical models classically involved Partial Differential equations (PDE) and depending of their underlying complexity and the level of accuracy required, and known to be computationally expensive to numerically solve them. Thus, an idea would be to create a surrogate model relying on data generated by such solver. However, training such a model on an imbalanced data have been shown to be a very difficult task. Indeed, if the distribution of input leads to a poor response manifold representation, the model may not learn well and consequently, it may not predict the outcome with acceptable accuracy. In this work, we present an Adaptive Sampling Algorithm for Data Generation (ASADG) involving a physical model. As the initial input data may not accurately represent the response manifold in higher dimension, this algorithm iteratively adds input data into it. At each step the barycenter of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Gaussian Processes and Bayesian Inference · Advanced Multi-Objective Optimization Algorithms
MethodsFocus
