Adaptive Compact Attention For Few-shot Video-to-video Translation

Risheng Huang; Li Shen; Xuan Wang; Cheng Lin; Hao-Zhi Huang

arXiv:2011.14695·cs.CV·December 1, 2020

Adaptive Compact Attention For Few-shot Video-to-video Translation

Risheng Huang, Li Shen, Xuan Wang, Cheng Lin, Hao-Zhi Huang

PDF

Open Access

TL;DR

This paper introduces an adaptive compact attention mechanism for few-shot video-to-video translation that efficiently leverages multiple reference images to produce more realistic and temporally consistent videos, outperforming existing methods.

Contribution

The paper proposes a novel adaptive compact attention model that jointly extracts contextual features from multiple references and includes a reference selection method based on Delaunay Triangulation.

Findings

01

Superior performance on large-scale datasets

02

Produces photorealistic, temporally consistent videos

03

Significant improvements over state-of-the-art methods

Abstract

This paper proposes an adaptive compact attention model for few-shot video-to-video translation. Existing works in this domain only use features from pixel-wise attention without considering the correlations among multiple reference images, which leads to heavy computation but limited performance. Therefore, we introduce a novel adaptive compact attention mechanism to efficiently extract contextual features jointly from multiple reference images, of which encoded view-dependent and motion-dependent information can significantly benefit the synthesis of realistic videos. Our core idea is to extract compact basis sets from all the reference images as higher-level representations. To further improve the reliability, in the inference phase, we also propose a novel method based on the Delaunay Triangulation algorithm to automatically select the resourceful references according to the input…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Video Analysis and Summarization · Advanced Image Processing Techniques