CoDeF: Content Deformation Fields for Temporally Consistent Video   Processing

Hao Ouyang; Qiuyu Wang; Yuxi Xiao; Qingyan Bai; Juntao Zhang; Kecheng; Zheng; Xiaowei Zhou; Qifeng Chen; Yujun Shen

arXiv:2308.07926·cs.CV·December 13, 2024·1 cites

CoDeF: Content Deformation Fields for Temporally Consistent Video Processing

Hao Ouyang, Qiuyu Wang, Yuxi Xiao, Qingyan Bai, Juntao Zhang, Kecheng, Zheng, Xiaowei Zhou, Qifeng Chen, Yujun Shen

PDF

Open Access 1 Repo

TL;DR

CoDeF introduces a novel video representation combining a static canonical content field with a temporal deformation field, enabling effective lifting of image algorithms to videos for consistent processing and tracking without additional training.

Contribution

The paper proposes CoDeF, a new content deformation field for videos that allows image algorithms to be applied to videos with improved temporal consistency and object tracking capabilities.

Findings

01

Supports lifting image algorithms to videos without training

02

Achieves superior cross-frame consistency in video processing

03

Enables tracking of non-rigid objects like water and smog

Abstract

We present the content deformation field CoDeF as a new type of video representation, which consists of a canonical content field aggregating the static contents in the entire video and a temporal deformation field recording the transformations from the canonical image (i.e., rendered from the canonical content field) to each individual frame along the time axis. Given a target video, these two fields are jointly optimized to reconstruct it through a carefully tailored rendering pipeline. We advisedly introduce some regularizations into the optimization process, urging the canonical content field to inherit semantics (e.g., the object shape) from the video. With such a design, CoDeF naturally supports lifting image algorithms for video processing, in the sense that one can apply an image algorithm to the canonical image and effortlessly propagate the outcomes to the entire video with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

qiuyu96/codef
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Advanced Image and Video Retrieval Techniques · Advanced Vision and Imaging