Inference-time Stochastic Refinement of GRU-Normalizing Flow for Real-time Video Motion Transfer
Tasmiah Haque, Srinjoy Das

TL;DR
This paper introduces a novel inference-time stochastic refinement method for GRU-Normalizing Flows, enhancing diversity and realism in real-time video motion transfer predictions without retraining.
Contribution
It proposes integrating MCMC steps during inference in GRU-NF models, inspired by Stochastic Normalizing Flows, to better explore output space and improve diversity.
Findings
Outperforms GRU-NF in generating diverse, realistic trajectories
Maintains accuracy over longer prediction horizons
Enhances multimodal behavior capture during inference
Abstract
Real-time video motion transfer applications such as immersive gaming and vision-based anomaly detection require accurate yet diverse future predictions to support realistic synthesis and robust downstream decision making under uncertainty. To improve the diversity of such sequential forecasts we propose a novel inference-time refinement technique that combines Gated Recurrent Unit-Normalizing Flows (GRU-NF) with stochastic sampling methods. While GRU-NF can capture multimodal distributions through its integration of normalizing flows within a temporal forecasting framework, its deterministic transformation structure can limit expressivity. To address this, inspired by Stochastic Normalizing Flows (SNF), we introduce Markov Chain Monte Carlo (MCMC) steps during GRU-NF inference, enabling the model to explore a richer output space and better approximate the true data distribution without…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Motion and Animation · Generative Adversarial Networks and Image Synthesis · Advanced Vision and Imaging
