An introduction to distributed training of deep neural networks for segmentation tasks with large seismic datasets
Claire Birnie, Haithem Jarraya, Fredrik Hansteen

TL;DR
This paper presents a distributed training approach with data generators for large seismic datasets, enabling training of bigger models without subsampling or size restrictions, significantly reducing training time.
Contribution
It introduces a scalable training method combining data generators and distributed training to handle large seismic datasets and models efficiently.
Findings
Training time reduced by a factor of four.
Able to train on over 750GB of data.
Supports larger models and higher-resolution data.
Abstract
Deep learning applications are drastically progressing in seismic processing and interpretation tasks. However, the majority of approaches subsample data volumes and restrict model sizes to minimise computational requirements. Subsampling the data risks losing vital spatio-temporal information which could aid training whilst restricting model sizes can impact model performance, or in some extreme cases, renders more complicated tasks such as segmentation impossible. This paper illustrates how to tackle the two main issues of training of large neural networks: memory limitations and impracticably large training times. Typically, training data is preloaded into memory prior to training, a particular challenge for seismic applications where data is typically four times larger than that used for standard image processing tasks (float32 vs. uint8). Using a microseismic use case, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
