PIPE: Physics-Informed Position Encoding for Alignment of Satellite Images and Time Series

Haobo Li; Eunseo Jung; Zixin Chen; Zhaowei Wang; Yueya Wang; Huamin Qu; Alexis Kai Hon Lau

arXiv:2506.14786·cs.LG·June 19, 2025

PIPE: Physics-Informed Position Encoding for Alignment of Satellite Images and Time Series

Haobo Li, Eunseo Jung, Zixin Chen, Zhaowei Wang, Yueya Wang, Huamin Qu, Alexis Kai Hon Lau

PDF

Open Access

TL;DR

This paper introduces PIPE, a physics-informed positional encoding method that enhances the alignment and forecasting accuracy of satellite images and time series data by embedding physical information into vision language models.

Contribution

PIPE is a novel, lightweight encoding scheme that incorporates physical and sequential information into vision language models for improved multimodal time series forecasting.

Findings

01

Achieves state-of-the-art typhoon forecasting accuracy

02

Improves multimodal alignment in satellite image and time series data

03

Demonstrates 12% performance boost over prior methods

Abstract

Multimodal time series forecasting is foundational in various fields, such as utilizing satellite imagery and numerical data for predicting typhoons in climate science. However, existing multimodal approaches primarily focus on utilizing text data to help time series forecasting, leaving the visual data in existing time series datasets untouched. Furthermore, it is challenging for models to effectively capture the physical information embedded in visual data, such as satellite imagery's temporal and geospatial context, which extends beyond images themselves. To address this gap, we propose physics-informed positional encoding (PIPE), a lightweight method that embeds physical information into vision language models (VLMs). PIPE introduces two key innovations: (1) a physics-informed positional indexing scheme for mapping physics to positional IDs, and (2) a variant-frequency positional…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGeophysics and Gravity Measurements