Parameter Efficient Deep Probabilistic Forecasting
Olivier Sprangers, Sebastian Schelter, Maarten de Rijke

TL;DR
This paper introduces a parameter-efficient Bidirectional Temporal Convolutional Network for probabilistic time series forecasting, achieving comparable accuracy to state-of-the-art models while significantly reducing memory and training costs.
Contribution
The novel BiTCN model combines two TCNs for encoding past and future covariates, requiring fewer parameters than Transformer-based models, with comparable forecasting performance.
Findings
Performs on par with state-of-the-art methods on real datasets
Uses significantly fewer parameters, reducing training time and memory
Maintains accuracy across multiple evaluation metrics
Abstract
Probabilistic time series forecasting is crucial in many application domains such as retail, ecommerce, finance, or biology. With the increasing availability of large volumes of data, a number of neural architectures have been proposed for this problem. In particular, Transformer-based methods achieve state-of-the-art performance on real-world benchmarks. However, these methods require a large number of parameters to be learned, which imposes high memory requirements on the computational resources for training such models. To address this problem, we introduce a novel Bidirectional Temporal Convolutional Network (BiTCN), which requires an order of magnitude less parameters than a common Transformer-based approach. Our model combines two Temporal Convolutional Networks (TCNs): the first network encodes future covariates of the time series, whereas the second network encodes past…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTime Series Analysis and Forecasting · Stock Market Forecasting Methods · Forecasting Techniques and Applications
MethodsDilated Causal Convolution · Mixture of Logistic Distributions · WaveNet
