EMFormer: Efficient Multi-Scale Transformer for Accumulative Context Weather Forecasting

Hao Chen; Tao Han; Jie Zhang; Song Guo; Fenghua Ling; Lei Bai

arXiv:2602.01194·cs.CV·May 12, 2026

EMFormer: Efficient Multi-Scale Transformer for Accumulative Context Weather Forecasting

Hao Chen, Tao Han, Jie Zhang, Song Guo, Fenghua Ling, Lei Bai

PDF

1 Repo

TL;DR

EMFormer introduces an efficient multi-scale transformer architecture with a novel training pipeline, significantly improving long-term weather forecasting accuracy and speed while maintaining temporal consistency.

Contribution

The paper proposes EMFormer, a new multi-scale transformer architecture with a unique training pipeline, enhancing long-context weather forecasting and reducing computational costs.

Findings

01

Achieves state-of-the-art long-term weather forecast accuracy.

02

Demonstrates strong generalization on vision benchmarks.

03

Provides a 5.69x speedup over traditional multi-scale modules.

Abstract

Long-term weather forecasting is critical for socioeconomic planning and disaster preparedness. While recent approaches employ finetuning to extend prediction horizons, they remain constrained by the issues of catastrophic forgetting, error accumulation, and high training overhead. To address these limitations, we present a novel pipeline across pretraining, finetuning and forecasting to enhance long-context modeling while reducing computational overhead. First, we introduce an Efficient Multi-scale Transformer (EMFormer) to extract multi-scale features through a single convolution in both training and inference. Based on the new architecture, we further employ an accumulative context finetuning to improve temporal consistency without degrading short-term accuracy. Additionally, we propose a composite loss that dynamically balances different terms via a sinusoidal weighting, thereby…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

chenhao-zju/emformer
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.