Scalable Self-Supervised Representation Learning from Spatiotemporal Motion Trajectories for Multimodal Computer Vision
Swetava Ganguli, C. V. Krishnakumar Iyer, Vipul Pandey

TL;DR
This paper introduces a scalable self-supervised method to learn meaningful geospatial representations from GPS trajectories, improving downstream tasks by capturing spatial connectivity patterns on the Earth's surface.
Contribution
It proposes a novel approach to generate reachability embeddings from GPS data, enabling better geospatial feature representations for multimodal computer vision tasks.
Findings
Reachability embeddings outperform baseline pixel representations in geospatial tasks.
The method achieves 4-23% performance gains in AUPRC across five tasks.
The approach effectively captures spatial connectivity patterns from unlabeled GPS data.
Abstract
Self-supervised representation learning techniques utilize large datasets without semantic annotations to learn meaningful, universal features that can be conveniently transferred to solve a wide variety of downstream supervised tasks. In this work, we propose a self-supervised method for learning representations of geographic locations from unlabeled GPS trajectories to solve downstream geospatial computer vision tasks. Tiles resulting from a raster representation of the earth's surface are modeled as nodes on a graph or pixels of an image. GPS trajectories are modeled as allowed Markovian paths on these nodes. A scalable and distributed algorithm is presented to compute image-like representations, called reachability summaries, of the spatial connectivity patterns between tiles and their neighbors implied by the observed Markovian paths. A convolutional, contractive autoencoder is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsContractive Autoencoder · Greedy Policy Search
