TL;DR
This paper introduces a domain-agnostic, data-driven framework called Adversarial Distribution Alignment (ADA) that uses generative models to bridge the gap between simulation and experimental data, validated across various scientific datasets.
Contribution
The paper presents ADA, a novel adversarial training method that aligns generative models trained on simulations with real experimental observations, recovering target distributions even with multiple correlated observables.
Findings
Successfully aligns generative models with experimental data across multiple domains.
Proves that ADA recovers the target observable distribution under certain conditions.
Empirically validated on synthetic, molecular, and protein experimental data.
Abstract
A fundamental challenge in science and engineering is the simulation-to-experiment gap. While we often possess prior knowledge of physical laws, these physical laws can be too difficult to solve exactly for complex systems. Such systems are commonly modeled using simulators, which impose computational approximations. Meanwhile, experimental measurements more faithfully represent the real world, but experimental data typically consists of observations that only partially reflect the system's full underlying state. We propose a data-driven distribution alignment framework that bridges this simulation-to-experiment gap by pre-training a generative model on fully observed (but imperfect) simulation data, then aligning it with partial (but real) observations of experimental data. While our method is domain-agnostic, we ground our approach in the physical sciences by introducing Adversarial…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
