Flow-based Spatio-Temporal Structured Prediction of Motion Dynamics
Mohsen Zand, Ali Etemad, and Michael Greenspan

TL;DR
MotionFlow is a novel normalizing flows model that effectively captures complex spatio-temporal motion dynamics, enabling probabilistic predictions across various structured tasks.
Contribution
It introduces a new autoregressive normalizing flows approach with conditional priors and masked convolutions for modeling high-dimensional, time-dependent structured data.
Findings
Successfully applied to trajectory and motion prediction tasks.
Outperforms existing models in modeling complex temporal distributions.
Demonstrates flexibility across multiple structured prediction applications.
Abstract
Conditional Normalizing Flows (CNFs) are flexible generative models capable of representing complicated distributions with high dimensionality and large interdimensional correlations, making them appealing for structured output learning. Their effectiveness in modelling multivariates spatio-temporal structured data has yet to be completely investigated. We propose MotionFlow as a novel normalizing flows approach that autoregressively conditions the output distributions on the spatio-temporal input features. It combines deterministic and stochastic representations with CNFs to create a probabilistic neural generative approach that can model the variability seen in high dimensional structured spatio-temporal data. We specifically propose to use conditional priors to factorize the latent space for the time dependent modeling. We also exploit the use of masked convolutions as autoregressive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTime Series Analysis and Forecasting · Generative Adversarial Networks and Image Synthesis · Music and Audio Processing
