Reprogramming Vision Foundation Models for Spatio-Temporal Forecasting
Changlu Chen, Yanbin Liu, Chaoxi Niu, Ling Chen, Tianqing Zhu

TL;DR
This paper introduces ST-VFM, a framework that reprograms vision foundation models for spatio-temporal forecasting by integrating dual-branch architecture and specialized reprogramming stages, significantly improving performance on diverse datasets.
Contribution
The paper presents a novel dual-branch reprogramming framework that enables vision foundation models to effectively handle spatio-temporal forecasting tasks without modifying the backbone.
Findings
Outperforms state-of-the-art baselines on ten datasets
Works effectively across multiple VFM backbones like DINO, CLIP, DEIT
Demonstrates robustness and generality of the proposed framework
Abstract
Foundation models have achieved remarkable success in natural language processing and computer vision, demonstrating strong capabilities in modeling complex patterns. While recent efforts have explored adapting large language models (LLMs) for time-series forecasting, LLMs primarily capture one-dimensional sequential dependencies and struggle to model the richer spatio-temporal (ST) correlations essential for accurate ST forecasting. In this paper, we present \textbf{ST-VFM}, a novel framework that systematically reprograms Vision Foundation Models (VFMs) for general-purpose spatio-temporal forecasting. While VFMs offer powerful spatial priors, two key challenges arise when applying them to ST tasks: (1) the lack of inherent temporal modeling capacity and (2) the modality gap between visual and ST data. To address these, ST-VFM adopts a \emph{dual-branch architecture} that integrates…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSatellite Image Processing and Photogrammetry · Remote Sensing in Agriculture · Remote Sensing and LiDAR Applications
