Video Compression for Spatiotemporal Earth System Data
Oscar J. Pellicer-Valero, Cesar Aybar, Gustau Camps Valls

TL;DR
The paper introduces xarrayvideo, a Python library that applies video compression techniques to large Earth system datasets, achieving high compression ratios while preserving data fidelity for scientific analysis.
Contribution
It presents a novel approach of encoding multichannel spatiotemporal datasets as videos using standard codecs, enabling efficient compression and distribution for Earth science applications.
Findings
Achieves up to 250x compression ratios with high PSNR.
Maintains data quality in downstream machine learning tasks.
Reduces dataset sizes significantly without performance loss.
Abstract
Large-scale Earth system datasets, from high-resolution remote sensing imagery to spatiotemporal climate model outputs, exhibit characteristics analogous to those of standard videos. Their inherent spatial, temporal, and spectral redundancies can thus be readily exploited by established video compression techniques. Here, we present xarrayvideo, a Python library for compressing multichannel spatiotemporal datasets by encoding them as videos. Our approach achieves compression ratios of up to 250x while maintaining high fidelity by leveraging standard, well-optimized video codecs through ffmpeg. We demonstrate the library's effectiveness on four real-world multichannel spatiotemporal datasets: DynamicEarthNet (very high resolution Planet images), DeepExtremeCubes (high resolution Sentinel-2 images), ERA5 (weather reanalysis data), and the SimpleS2 dataset (high resolution multichannel…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDistributed and Parallel Computing Systems · Computational Physics and Python Applications · Data Management and Algorithms
