TL;DR
This paper introduces ST-SampleNet, a transformer-based model that effectively captures local and global spatio-temporal relations in city data, improving prediction accuracy while reducing computational costs through innovative sampling and spatial embedding techniques.
Contribution
The paper presents a novel transformer architecture combining CNNs with self-attention, along with a lightweight sampling strategy and spatially constrained embeddings for efficient, accurate spatio-temporal prediction.
Findings
Effective capture of local and global relations in city data
40% reduction in computational costs with minimal performance loss
Demonstrated superior results on three real-world datasets
Abstract
Accurate spatio-temporal prediction is crucial for the sustainable development of smart cities. However, current approaches often struggle to capture important spatio-temporal relationships, particularly overlooking global relations among distant city regions. Most existing techniques predominantly rely on Convolutional Neural Networks (CNNs) to capture global relations. However, CNNs exhibit neighbourhood bias, making them insufficient for capturing distant relations. To address this limitation, we propose ST-SampleNet, a novel transformer-based architecture that combines CNNs with self-attention mechanisms to capture both local and global relations effectively. Moreover, as the number of regions increases, the quadratic complexity of self-attention becomes a challenge. To tackle this issue, we introduce a lightweight region sampling strategy that prunes non-essential regions and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
