Effective Dataset Distillation for Spatio-Temporal Forecasting with Bi-dimensional Compression
Taehyung Kwon, Yeonje Choi, Yeongho Kim, Kijung Shin

TL;DR
This paper introduces STemDist, a novel dataset distillation method for spatio-temporal forecasting that compresses both spatial and temporal dimensions, significantly reducing training time and memory while improving prediction accuracy.
Contribution
STemDist is the first method to jointly compress spatial and temporal dimensions in spatio-temporal datasets, enhancing efficiency and effectiveness of deep learning models.
Findings
Up to 6X faster training
Up to 8X less memory usage
Up to 12% lower prediction error
Abstract
Spatio-temporal time series are widely used in real-world applications, including traffic prediction and weather forecasting. They are sequences of observations over extensive periods and multiple locations, naturally represented as multidimensional data. Forecasting is a central task in spatio-temporal analysis, and numerous deep learning methods have been developed to address it. However, as dataset sizes and model complexities continue to grow in practice, training deep learning models has become increasingly time- and resource-intensive. A promising solution to this challenge is dataset distillation, which synthesizes compact datasets that can effectively replace the original data for model training. Although successful in various domains, including time series analysis, existing dataset distillation methods compress only one dimension, making them less suitable for spatio-temporal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTraffic Prediction and Management Techniques · Time Series Analysis and Forecasting · Human Mobility and Location-Based Analysis
