MQTransformer: Multi-Horizon Forecasts with Context Dependent and   Feedback-Aware Attention

Carson Eisenach; Yagna Patel; Dhruv Madeka

arXiv:2009.14799·cs.LG·January 28, 2022·5 cites

MQTransformer: Multi-Horizon Forecasts with Context Dependent and Feedback-Aware Attention

Carson Eisenach, Yagna Patel, Dhruv Madeka

PDF

Open Access

TL;DR

This paper introduces MQTransformer, a neural forecasting model that leverages Transformer-inspired attention mechanisms and novel positional encoding to improve probabilistic demand prediction accuracy and reduce forecast variability.

Contribution

It proposes a new decoder-encoder attention, context-dependent positional encoding, and a decoder-self attention scheme, advancing neural forecasting methods.

Findings

01

Improved forecasting accuracy with context-aware attention.

02

Reduced forecast variability through self-attention mechanisms.

03

Enhanced modeling of seasonality and holiday effects.

Abstract

Recent advances in neural forecasting have produced major improvements in accuracy for probabilistic demand prediction. In this work, we propose novel improvements to the current state of the art by incorporating changes inspired by recent advances in Transformer architectures for Natural Language Processing. We develop a novel decoder-encoder attention for context-alignment, improving forecasting accuracy by allowing the network to study its own history based on the context for which it is producing a forecast. We also present a novel positional encoding that allows the neural network to learn context-dependent seasonality functions as well as arbitrary holiday distances. Finally we show that the current state of the art MQ-Forecaster (Wen et al., 2017) models display excess variability by failing to leverage previous errors in the forecast to improve accuracy. We propose a novel…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTime Series Analysis and Forecasting · Traffic Prediction and Management Techniques · Stock Market Forecasting Methods

MethodsLinear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Dense Connections · Dropout · Layer Normalization · Byte Pair Encoding · Label Smoothing · Multi-Head Attention · Attention Is All You Need