Learning Blind Video Temporal Consistency

Wei-Sheng Lai; Jia-Bin Huang; Oliver Wang; Eli Shechtman; Ersin Yumer,; Ming-Hsuan Yang

arXiv:1808.00449·cs.CV·August 2, 2018·1 cites

Learning Blind Video Temporal Consistency

Wei-Sheng Lai, Jia-Bin Huang, Oliver Wang, Eli Shechtman, Ersin Yumer,, Ming-Hsuan Yang

PDF

Open Access 1 Repo

TL;DR

This paper introduces a deep recurrent network that enforces temporal consistency across video frames, working with various image processing tasks without task-specific domain knowledge, and operates in real-time.

Contribution

The proposed end-to-end deep recurrent network achieves real-time, task-agnostic temporal consistency in videos without optical flow, handling multiple unseen tasks effectively.

Findings

01

Outperforms state-of-the-art methods on various video tasks

02

Operates in real-time even on high-resolution videos

03

Handles multiple and unseen video processing tasks

Abstract

Applying image processing algorithms independently to each frame of a video often leads to undesired inconsistent results over time. Developing temporally consistent video-based extensions, however, requires domain knowledge for individual tasks and is unable to generalize to other applications. In this paper, we present an efficient end-to-end approach based on deep recurrent network for enforcing temporal consistency in a video. Our method takes the original unprocessed and per-frame processed videos as inputs to produce a temporally consistent video. Consequently, our approach is agnostic to specific image processing algorithms applied on the original video. We train the proposed network by minimizing both short-term and long-term temporal losses as well as the perceptual loss to strike a balance between temporal stability and perceptual similarity with the processed frames. At test…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

phoenix104104/fast_blind_video_consistency
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image Processing Techniques · Advanced Vision and Imaging · Image Enhancement Techniques

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings