Render In-between: Motion Guided Video Synthesis for Action Interpolation
Hsuan-I Ho, Xu Chen, Jie Song, Otmar Hilliges

TL;DR
This paper introduces a motion-guided video synthesis method for action interpolation that generates realistic intermediate frames from low-frame-rate videos using a novel motion model and neural rendering, without needing high-frame-rate training data.
Contribution
It presents a new motion model trained on large-scale motion-capture data and a neural rendering pipeline for high-quality frame interpolation, along with the first high-quality evaluation dataset for this task.
Findings
Outperforms state-of-the-art interpolation methods in quality and accuracy
Produces more realistic human motion and appearance in generated frames
Achieves superior results on pixel-level and distributional metrics
Abstract
Upsampling videos of human activity is an interesting yet challenging task with many potential applications ranging from gaming to entertainment and sports broadcasting. The main difficulty in synthesizing video frames in this setting stems from the highly complex and non-linear nature of human motion and the complex appearance and texture of the body. We propose to address these issues in a motion-guided frame-upsampling framework that is capable of producing realistic human motion and appearance. A novel motion model is trained to inference the non-linear skeletal motion between frames by leveraging a large-scale motion-capture dataset (AMASS). The high-frame-rate pose predictions are then used by a neural rendering pipeline to produce the full-frame output, taking the pose and background consistency into consideration. Our pipeline only requires low-frame-rate videos and unpaired…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Human Pose and Action Recognition · Advanced Image Processing Techniques
