Scalable Self-Supervised Representation Learning from Spatiotemporal   Motion Trajectories for Multimodal Computer Vision

Swetava Ganguli; C. V. Krishnakumar Iyer; Vipul Pandey

arXiv:2210.03289·cs.CV·October 10, 2022

Scalable Self-Supervised Representation Learning from Spatiotemporal Motion Trajectories for Multimodal Computer Vision

Swetava Ganguli, C. V. Krishnakumar Iyer, Vipul Pandey

PDF

TL;DR

This paper introduces a scalable self-supervised method to learn meaningful geospatial representations from GPS trajectories, improving downstream tasks by capturing spatial connectivity patterns on the Earth's surface.

Contribution

It proposes a novel approach to generate reachability embeddings from GPS data, enabling better geospatial feature representations for multimodal computer vision tasks.

Findings

01

Reachability embeddings outperform baseline pixel representations in geospatial tasks.

02

The method achieves 4-23% performance gains in AUPRC across five tasks.

03

The approach effectively captures spatial connectivity patterns from unlabeled GPS data.

Abstract

Self-supervised representation learning techniques utilize large datasets without semantic annotations to learn meaningful, universal features that can be conveniently transferred to solve a wide variety of downstream supervised tasks. In this work, we propose a self-supervised method for learning representations of geographic locations from unlabeled GPS trajectories to solve downstream geospatial computer vision tasks. Tiles resulting from a raster representation of the earth's surface are modeled as nodes on a graph or pixels of an image. GPS trajectories are modeled as allowed Markovian paths on these nodes. A scalable and distributed algorithm is presented to compute image-like representations, called reachability summaries, of the spatial connectivity patterns between tiles and their neighbors implied by the observed Markovian paths. A convolutional, contractive autoencoder is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsContractive Autoencoder · Greedy Policy Search