HierCVAE: Hierarchical Attention-Driven Conditional Variational Autoencoders for Multi-Scale Temporal Modeling

Yao Wu

arXiv:2508.18922·cs.LG·August 27, 2025

HierCVAE: Hierarchical Attention-Driven Conditional Variational Autoencoders for Multi-Scale Temporal Modeling

Yao Wu

PDF

TL;DR

HierCVAE is a hierarchical attention-based variational autoencoder designed for multi-scale temporal modeling, effectively capturing complex dependencies and uncertainties in systems like energy consumption.

Contribution

It introduces a novel three-tier attention structure combined with conditional VAEs and ResFormer blocks for improved multi-scale temporal prediction.

Findings

01

15-40% improvement in prediction accuracy

02

Superior uncertainty calibration

03

Excels in long-term forecasting and multi-variate dependencies

Abstract

Temporal modeling in complex systems requires capturing dependencies across multiple time scales while managing inherent uncertainties. We propose HierCVAE, a novel architecture that integrates hierarchical attention mechanisms with conditional variational autoencoders to address these challenges. HierCVAE employs a three-tier attention structure (local, global, cross-temporal) combined with multi-modal condition encoding to capture temporal, statistical, and trend information. The approach incorporates ResFormer blocks in the latent space and provides explicit uncertainty quantification via prediction heads. Through evaluations on energy consumption datasets, HierCVAE demonstrates a 15-40% improvement in prediction accuracy and superior uncertainty calibration compared to state-of-the-art methods, excelling in long-term forecasting and complex multi-variate dependencies.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.