CoDeF: Content Deformation Fields for Temporally Consistent Video Processing
Hao Ouyang, Qiuyu Wang, Yuxi Xiao, Qingyan Bai, Juntao Zhang, Kecheng, Zheng, Xiaowei Zhou, Qifeng Chen, Yujun Shen

TL;DR
CoDeF introduces a novel video representation combining a static canonical content field with a temporal deformation field, enabling effective lifting of image algorithms to videos for consistent processing and tracking without additional training.
Contribution
The paper proposes CoDeF, a new content deformation field for videos that allows image algorithms to be applied to videos with improved temporal consistency and object tracking capabilities.
Findings
Supports lifting image algorithms to videos without training
Achieves superior cross-frame consistency in video processing
Enables tracking of non-rigid objects like water and smog
Abstract
We present the content deformation field CoDeF as a new type of video representation, which consists of a canonical content field aggregating the static contents in the entire video and a temporal deformation field recording the transformations from the canonical image (i.e., rendered from the canonical content field) to each individual frame along the time axis. Given a target video, these two fields are jointly optimized to reconstruct it through a carefully tailored rendering pipeline. We advisedly introduce some regularizations into the optimization process, urging the canonical content field to inherit semantics (e.g., the object shape) from the video. With such a design, CoDeF naturally supports lifting image algorithms for video processing, in the sense that one can apply an image algorithm to the canonical image and effortlessly propagate the outcomes to the entire video with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Advanced Image and Video Retrieval Techniques · Advanced Vision and Imaging
