Enhancing Spatiotemporal Prediction Model using Modular Design and Beyond
Haoyu Pan, Hao Wu, Tan Yang

TL;DR
This paper introduces a modular approach to spatiotemporal prediction that separates spatial encoding from future prediction, leading to improved performance and state-of-the-art results on benchmark datasets.
Contribution
It proposes a novel modular design decomposing the model into spatial encoder-decoder and predictor modules, reducing complexity and enhancing convergence.
Findings
Achieved state-of-the-art results on KTH-Action and MovingMNIST datasets.
Improved computational efficiency compared to traditional joint models.
Demonstrated robustness and accuracy of the modular approach.
Abstract
Predictive learning uses a known state to generate a future state over a period of time. It is a challenging task to predict spatiotemporal sequence because the spatiotemporal sequence varies both in time and space. The mainstream method is to model spatial and temporal structures at the same time using RNN-based or transformer-based architecture, and then generates future data by using learned experience in the way of auto-regressive. The method of learning spatial and temporal features simultaneously brings a lot of parameters to the model, which makes the model difficult to be convergent. In this paper, a modular design is proposed, which decomposes spatiotemporal sequence model into two modules: a spatial encoder-decoder and a predictor. These two modules can extract spatial features and predict future data respectively. The spatial encoder-decoder maps the data into a latent…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Management and Algorithms · Human Mobility and Location-Based Analysis · Geographic Information Systems Studies
