A New Multi-Picture Architecture for Learned Video Deinterlacing and   Demosaicing with Parallel Deformable Convolution and Self-Attention Blocks

Ronglei Ji; A. Murat Tekalp

arXiv:2404.13018·eess.IV·April 22, 2024·Image Vis. Comput.

A New Multi-Picture Architecture for Learned Video Deinterlacing and Demosaicing with Parallel Deformable Convolution and Self-Attention Blocks

Ronglei Ji, A. Murat Tekalp

PDF

1 Repo

TL;DR

This paper introduces a novel multi-picture architecture utilizing deformable convolution and self-attention for improved learned video deinterlacing and demosaicing, outperforming existing methods in quality metrics.

Contribution

The paper presents a new multi-picture architecture with modified deformable convolution and a residual efficient top-k self-attention block for better video deinterlacing and demosaicing.

Findings

01

Significantly exceeds state-of-the-art in PSNR and SSIM.

02

Effective in both synthetic and real-world datasets.

03

Ablation studies validate each component's benefit.

Abstract

Despite the fact real-world video deinterlacing and demosaicing are well-suited to supervised learning from synthetically degraded data because the degradation models are known and fixed, learned video deinterlacing and demosaicing have received much less attention compared to denoising and super-resolution tasks. We propose a new multi-picture architecture for video deinterlacing or demosaicing by aligning multiple supporting pictures with missing data to a reference picture to be reconstructed, benefiting from both local and global spatio-temporal correlations in the feature space using modified deformable convolution blocks and a novel residual efficient top- $k$ self-attention (kSA) block, respectively. Separate reconstruction blocks are used to estimate different types of missing data. Our extensive experimental results, on synthetic or real-world datasets, demonstrate that the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

kuis-ai-tekalp-research-group/video-deinterlacing
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSelf-Attention Guidance · Convolution · Deformable Convolution