Cross-Identity Motion Transfer for Arbitrary Objects through Pose-Attentive Video Reassembling
Subin Jeon, Seonghyeon Nam, Seoung Wug Oh, Seon Joo Kim

TL;DR
This paper introduces an attention-based network for transferring motion between arbitrary objects in videos, leveraging dense similarity keypoints to reassemble source content for more realistic animation results.
Contribution
It presents a novel attention mechanism and a cross-identity training scheme that improve motion transfer quality across diverse object domains.
Findings
Produces more realistic motion transfer outputs.
Utilizes multiple source observations for improved accuracy.
Outperforms previous methods in visual quality.
Abstract
We propose an attention-based networks for transferring motions between arbitrary objects. Given a source image(s) and a driving video, our networks animate the subject in the source images according to the motion in the driving video. In our attention mechanism, dense similarities between the learned keypoints in the source and the driving images are computed in order to retrieve the appearance information from the source images. Taking a different approach from the well-studied warping based models, our attention-based model has several advantages. By reassembling non-locally searched pieces from the source contents, our approach can produce more realistic outputs. Furthermore, our system can make use of multiple observations of the source appearance (e.g. front and sides of faces) to make the results more accurate. To reduce the training-testing discrepancy of the self-supervised…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Vision and Imaging · Human Pose and Action Recognition
