Transforming the Use of Earth Observation Data: Exascale Training of a Generative Compression Model with Historical Priors for up to 10,000x Data Reduction

Jinxiao Zhang; Runmin Dong; Xiyong Wu; Xihan Huang; Shenggan Cheng; Yunkai Yang; Zheng Zhou; Yunpu Xu; Zhaoyang Luo; Miao Yang; Fan Wei; Mengxuan Chen; Yang You; Juepeng Zheng; Weijia Li; Yutong Lu; Haohuan Fu

arXiv:2605.08633·cs.DC·May 12, 2026

Transforming the Use of Earth Observation Data: Exascale Training of a Generative Compression Model with Historical Priors for up to 10,000x Data Reduction

Jinxiao Zhang, Runmin Dong, Xiyong Wu, Xihan Huang, Shenggan Cheng, Yunkai Yang, Zheng Zhou, Yunpu Xu, Zhaoyang Luo, Miao Yang, Fan Wei, Mengxuan Chen, Yang You, Juepeng Zheng, Weijia Li, Yutong Lu, Haohuan Fu

PDF

TL;DR

This paper introduces a generative compression framework for Earth observation data that leverages historical archives to achieve 100x to 10,000x data reduction, trained at exascale on a supercomputer.

Contribution

It presents a novel exascale training approach for a generative compression model that uses historical priors to enable extreme data reduction in Earth observation.

Findings

01

Achieved 1.54 EFLOP/s sustained training performance.

02

Enabled 100x to 10,000x data reduction across tasks.

03

Demonstrated the feasibility of historical-prior learning for Earth data compression.

Abstract

Earth observation is becoming one of the largest data-producing activities in science, yet current pipelines still treat compression as a storage and transmission tool rather than a new way to use data. We present a generative compression framework that learns from historical Earth observation archives and enables on-demand 100x to 10,000x data reduction across downstream tasks. Unlike general visual data, Earth observation repeatedly measures the same evolving planet, making historical-prior learning feasible for extreme compression. To realize this paradigm, we train large generative compression models at exascale on the LineShine Armv9 CPU supercomputer, with co-optimization across model design, kernels, memory hierarchy, runtime, and parallelism. Our implementation sustains 1.54 EFLOP/s and peaks at 2.16 EFLOP/s in end-to-end training. This work shows that historical-prior…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.