WaveDiffusion: Joint Latent Diffusion for Physically Consistent Seismic and Velocity Generation
Yinan Feng, Hanchen Wang, Yinpeng Chen, Luoyuan Zhang, Jeeun Kang, Yixuan Wu, Young Jin Kim, and Youzuo Lin

TL;DR
WaveDiffusion introduces a joint latent diffusion model that generates physically consistent seismic and velocity data by exploring the latent space structure, demonstrating PDE-valid solutions without explicit physics constraints.
Contribution
This work pioneers a generative latent diffusion approach for physically consistent seismic-velocity data, leveraging PDE-guided latent space refinement and sampling techniques.
Findings
Diffusion refines latent vectors into PDE-consistent pairs
Sampling biases toward physics-valid solutions
High-fidelity, diverse data generated on large-scale benchmarks
Abstract
Full Waveform Inversion (FWI) is a critical technique in subsurface imaging, aiming to reconstruct high-resolution subsurface properties from surface measurements. Acoustic FWI involves two physical modalities, seismic waveforms and velocity maps, which are governed by the acoustic wave equation. Prior works primarily focus on the inverse problem, modeling the relationship between seismic and velocity as an image-to-image translation task. In this work, we study their relationship from a generative perspective. Our aim is to explore and characterize the latent space structure, and identify latent vectors that generate seismic-velocity pairs consistent with the governing partial differential equation (PDE). Specifically, we model seismic and velocity data jointly from a shared latent space via a diffusion process. In experiments, we find that diffusion progressively refines arbitrary…
Peer Reviews
Decision·Submitted to ICLR 2026
Writing is mostly clear
- The paper repeatedly claims generated pairs "satisfy the governing PDE" but only measures L2 distance between decoded seismics and finite-difference solutions, which remains non-trivial (0.002) even after diffusion. No rigorous definition of what constitutes "satisfying" the PDE is provided, and the threshold ε in the validity criterion is never specified or justified. - The approach is a straightforward application of VQ-VAE + latent diffusion (Rombach et al. 2022) to paired seismic-velocity
* The proposed method is sound and appears to generally give more physically consistent samples than the original latent space of a FWI network.
* The authors only compare to BigFWI-B and are missing a diffusion-model-based baseline. See InverseBench (Zheng et al. ICLR 2025) for a comparison of state-of-the-art diffusion inverse solvers on the FWI task. According to their findings, DiffPIR (Zhu et al. ICCV 2023) would give the most accurate results. It is important to compare to a diffusion-based baseline since it is hard to tell whether WaveDiffusion is the best way to utilize a diffusion model for this problem. * I struggle to underst
1. The paper provides comprehensive experimental validation, comparing the proposed approach across multiple settings, and visualizing the progressive process, demonstrating its robustness and consistency. 2. The method effectively refines latent representations through a progressive diffusion process, offering new insights into how latent diffusion can enforce physical consistency in high-dimensional scientific data.
1. The approach appears to require retraining when observed geometries differ from those used in training, which may limit its applicability in real-world seismic scenarios with varying acquisition setups. 2. As acknowledged by the authors, the method lacks a theoretical guarantee for PDE satisfaction or convergence to physically consistent solutions. To strengthen the work, the authors should include comparisons or at least a discussion with related diffusion-based inverse modeling studies, s
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSeismic Imaging and Inversion Techniques
