Adaptive Compact Attention For Few-shot Video-to-video Translation
Risheng Huang, Li Shen, Xuan Wang, Cheng Lin, Hao-Zhi Huang

TL;DR
This paper introduces an adaptive compact attention mechanism for few-shot video-to-video translation that efficiently leverages multiple reference images to produce more realistic and temporally consistent videos, outperforming existing methods.
Contribution
The paper proposes a novel adaptive compact attention model that jointly extracts contextual features from multiple references and includes a reference selection method based on Delaunay Triangulation.
Findings
Superior performance on large-scale datasets
Produces photorealistic, temporally consistent videos
Significant improvements over state-of-the-art methods
Abstract
This paper proposes an adaptive compact attention model for few-shot video-to-video translation. Existing works in this domain only use features from pixel-wise attention without considering the correlations among multiple reference images, which leads to heavy computation but limited performance. Therefore, we introduce a novel adaptive compact attention mechanism to efficiently extract contextual features jointly from multiple reference images, of which encoded view-dependent and motion-dependent information can significantly benefit the synthesis of realistic videos. Our core idea is to extract compact basis sets from all the reference images as higher-level representations. To further improve the reliability, in the inference phase, we also propose a novel method based on the Delaunay Triangulation algorithm to automatically select the resourceful references according to the input…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Video Analysis and Summarization · Advanced Image Processing Techniques
