TL;DR
This paper introduces onion-peel networks for video completion that progressively fill missing regions by leveraging reference images and a novel asymmetric attention mechanism, enabling globally coherent inpainting of large holes.
Contribution
The paper presents a new onion-peel network architecture with asymmetric attention for improved video and image inpainting, capable of handling large holes and utilizing unlimited spatial-temporal context.
Findings
Produces visually pleasing inpainting results
Handles large holes effectively with progressive filling
Applicable to both video and image completion
Abstract
We propose the onion-peel networks for video completion. Given a set of reference images and a target image with holes, our network fills the hole by referring the contents in the reference images. Our onion-peel network progressively fills the hole from the hole boundary enabling it to exploit richer contextual information for the missing regions every step. Given a sufficient number of recurrences, even a large hole can be inpainted successfully. To attend to the missing information visible in the reference images, we propose an asymmetric attention block that computes similarities between the hole boundary pixels in the target and the non-hole pixels in the references in a non-local manner. With our attention block, our network can have an unlimited spatial-temporal window size and fill the holes with globally coherent contents. In addition, our framework is applicable to the image…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
