TRAJGANR: Trajectory-Centric Urban Multimodal Learning via Geospatially Aligned Neural Representations

Maria Despoina Siampou; Gengchen Mai; Ni Lao; Jinmeng Rao; Neha Arora; Cyrus Shahabi; Shushman Choudhury

arXiv:2605.06990·cs.CV·May 11, 2026

TRAJGANR: Trajectory-Centric Urban Multimodal Learning via Geospatially Aligned Neural Representations

Maria Despoina Siampou, Gengchen Mai, Ni Lao, Jinmeng Rao, Neha Arora, Cyrus Shahabi, Shushman Choudhury

PDF

TL;DR

TrajGANR introduces a trajectory-centric geospatial self-supervised learning framework that aligns continuous movement patterns with static observations, improving urban understanding tasks.

Contribution

It presents a novel neural representation of trajectories enabling fine-grained multimodal alignment, outperforming existing geospatial models.

Findings

01

TrajGANR outperforms existing frameworks on four urban tasks.

02

The proposed MSSL objective significantly improves alignment quality.

03

Fine-grained geospatial alignment is crucial for urban understanding.

Abstract

Multimodal self-supervised learning (MSSL) has emerged as a key paradigm for pretraining geospatial foundation models. However, existing geospatial MSSL methods are mainly designed for static pairs of modalities, such as satellite imagery, street-view imagery, and text, where learning is driven by aligning observations from the same or nearby locations. This assumption breaks down for human mobility trajectories, which represent continuous movement along paths rather than discrete observations at individual locations. Although trajectories are important for urban understanding through their ability to capture human activity across roads, neighborhoods, and places over time, they remain largely underexplored in current geospatial MSSL frameworks. We present TrajGANR, a novel trajectory-centric geospatial MSSL framework that aligns continuous movement patterns with static, location-based…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.