ECHOSAT: Estimating Canopy Height Over Space And Time
Jan Pauls, Karsten Schr\"odter, Sven Ligensa, Martin Schwartz, Berkant Turan, Max Zimmer, Sassan Saatchi, Sebastian Pokutta, Philippe Ciais, Fabian Gieseke

TL;DR
ECHOSAT introduces a global, temporally consistent tree height map at 10m resolution, capturing forest dynamics over multiple years using satellite data and a vision transformer model, enhancing carbon monitoring accuracy.
Contribution
This work presents the first global-scale, temporally consistent tree height map that models growth and disturbances over time using multi-sensor satellite data and a specialized vision transformer.
Findings
Improved accuracy over state-of-the-art single-year predictions.
First global-scale height map capturing growth and disturbances.
Model effectively detects forest loss events like fires.
Abstract
Forest monitoring is critical for climate change mitigation. However, existing global tree height maps provide only static snapshots and do not capture temporal forest dynamics, which are essential for accurate carbon accounting. We introduce ECHOSAT, a global and temporally consistent tree height map at 10 m resolution spanning multiple years. To this end, we resort to multi-sensor satellite data to train a specialized vision transformer model, which performs pixel-level temporal regression. A self-supervised growth loss regularizes the predictions to follow growth curves that are in line with natural tree development, including gradual height increases over time, but also abrupt declines due to forest loss events such as fires. Our experimental evaluation shows that our model improves state-of-the-art accuracies in the context of single-year predictions. We also provide the first…
Peer Reviews
Decision·Submitted to ICLR 2026
The paper is clearly structured, written in plain English, with the main text accommodated by illustrative figures, equations, tables, and appendices with additional details. The dataset is carefully curated (App. A.2.1) and the methodology well documented (App A.2.2). Experimental results are cleanly evaluated against existing methods (Sect. 4.3).
The work falls short in major novelties for the ICLR community regarding learning representation methods. While ECHOSAT resembles a valuable dataset for the Earth observation community, the Temporal-Swin-Unet (Sect. 3.2) blends minor adjustments (1x1 patch size, additional layers and skip connections) from existing architecture. The additional *Growth Loss* (Sect. 3.3) is specific to the application of tree height mapping, and resembles a neat, but limited innovation. Unfortuantely, the authors
- **Originality**: The growth loss enforces monotonic height increase and abrupt drops; this is the first global 10m multi-year canopy height map with inherent temporal modeling %%previous work looks at single years only. - **Quality**: The model uses multi-sensor data and sparse GEDI labels for GT. Ablations isolate growth loss impact on held-out GEDI (Table 1, p. 8). - **Clarity**: The paper is very well-written; it's clearly structured, with well-explained methods; outlines explicit contri
- **Clarity**: Equations are unnumbered, making referencing difficult. - **Significance**: Height-to-carbon flux not evaluated -- above-ground biomass (AGB) to CO₂ conversion or flux tower validation would enhance climate impact, i.e. in carbon accounting. - **Originality**: The main novelty lies in the growth loss (Sec 3.3), but a comparison to learned temporal dynamics in *TimeSformer* (Bertasius et al., ICCV 2021) or *EarthFormer* (Gao et al., NeurIPS 2022) would help quantify value the loss
STRENGTHS: I particularly enjoyed the following aspects of the paper: - The paper is well motivated and tackles an incredibly important real-world task that is clearly suited for ML/vision models. - The paper is very well written; it is easy to understand and has a good / intuitive flow. - Part of that is the papers simplicity; the paper (mostly, exceptions below) has a great balance of depth and simplicity; the proposed methodological advancement is simple but quite elegant and well motivated b
SHORTCOMINGS: I have two major concerns with the current draft of the manuscript: - Part of the "growth loss" is the disturbance indicator. The paragraph introducing it is too short and it is unclear how the disturbance indicator motivated? choice of thresholds here seem arbitrary? The authors say that "A disturbance is considered to occur in zref ∈ RY when a) tree height decreased by more than 50% and more than 4 m and b) tree height decreased to less than 10 m within two years." (line 224),
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRemote Sensing and LiDAR Applications · Remote Sensing in Agriculture · Synthetic Aperture Radar (SAR) Applications and Techniques
