PyAWD: A Library for Generating Large Synthetic Datasets of Acoustic Wave Propagation
Pascal Tribel, Gianluca Bontempi

TL;DR
PyAWD is a Python library that generates large, high-resolution synthetic seismic datasets simulating acoustic wave propagation in complex media, facilitating machine learning applications in earthquake analysis.
Contribution
The paper introduces PyAWD, a novel tool for creating customizable, large-scale synthetic seismic datasets to support ML research in seismology.
Findings
PyAWD successfully generates realistic seismic datasets.
The library aids in epicenter retrieval tasks.
It helps optimize data collection strategies.
Abstract
Seismic data is often sparse and unevenly distributed due to the high costs and logistical challenges associated with deploying physical seismometers, limiting the application of Machine Learning (ML) in earthquake analysis. While simulation methods exist, no tool allows the generation of large datasets containing simulated measurements of the ground motion. To address this gap, we introduce PyAWD, a Python library designed to generate high-resolution synthetic datasets simulating spatio-temporal acoustic wave propagation in both two-dimensional and three-dimensional heterogeneous media. By allowing fine control over parameters such as the wave speed, external forces, spatial and temporal discretization, and media composition, PyAWD enables the creation of ML-scale datasets that capture the complexity of seismic wave behavior. We illustrate the library's potential with an epicenter…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech Recognition and Synthesis · Speech and Audio Processing · Music and Audio Processing
MethodsLib
