HYouTube: Video Harmonization Dataset

Xinyuan Lu; Shengyuan Huang; Li Niu; Wenyan Cong; Liqing Zhang

arXiv:2109.08809·cs.CV·September 21, 2021

HYouTube: Video Harmonization Dataset

Xinyuan Lu, Shengyuan Huang, Li Niu, Wenyan Cong, Liqing Zhang

PDF

Open Access 1 Repo

TL;DR

This paper introduces HYouTube, a new dataset for video harmonization, addressing the lack of public datasets and facilitating research on adjusting foregrounds in composite videos for better visual consistency.

Contribution

The paper presents the first large-scale video harmonization dataset, HYouTube, including synthetic and real composite videos, to advance research in this underexplored area.

Findings

01

Dataset enables training of video harmonization models.

02

Synthetic and real composite videos highlight domain gaps.

03

Provides a benchmark for future video harmonization research.

Abstract

Video composition aims to generate a composite video by combining the foreground of one video with the background of another video, but the inserted foreground may be incompatible with the background in terms of color and illumination. Video harmonization aims to adjust the foreground of a composite video to make it compatible with the background. So far, video harmonization has only received limited attention and there is no public dataset for video harmonization. In this work, we construct a new video harmonization dataset HYouTube by adjusting the foreground of real videos to create synthetic composite videos. Considering the domain gap between real composite videos and synthetic composite videos, we additionally create 100 real composite videos via copy-and-paste. Datasets are available at https://github.com/bcmi/Video-Harmonization-Dataset-HYouTube.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

bcmi/video-harmonization-dataset-hyoutube
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Video Coding and Compression Technologies · Advanced Image Processing Techniques