Ghost-free High Dynamic Range Imaging with Context-aware Transformer

Zhen Liu; Yinglong Wang; Bing Zeng; Shuaicheng Liu

arXiv:2208.05114·cs.CV·August 11, 2022·1 cites

Ghost-free High Dynamic Range Imaging with Context-aware Transformer

Zhen Liu, Yinglong Wang, Bing Zeng, Shuaicheng Liu

PDF

Open Access 3 Repos

TL;DR

This paper introduces a novel dual-branch Vision Transformer architecture for ghost-free HDR imaging, effectively capturing global and local dependencies to reduce artifacts and distortions caused by motion and saturation.

Contribution

The paper proposes a dual-branch CA-ViT architecture combining global and local context modeling for improved HDR deghosting, which outperforms existing CNN-based methods.

Findings

01

Outperforms state-of-the-art methods quantitatively and qualitatively

02

Reduces computational costs significantly

03

Effectively handles large motion and saturation artifacts

Abstract

High dynamic range (HDR) deghosting algorithms aim to generate ghost-free HDR images with realistic details. Restricted by the locality of the receptive field, existing CNN-based methods are typically prone to producing ghosting artifacts and intensity distortions in the presence of large motion and severe saturation. In this paper, we propose a novel Context-Aware Vision Transformer (CA-ViT) for ghost-free high dynamic range imaging. The CA-ViT is designed as a dual-branch architecture, which can jointly capture both global and local dependencies. Specifically, the global branch employs a window-based Transformer encoder to model long-range object movements and intensity variations to solve ghosting. For the local branch, we design a local context extractor (LCE) to capture short-range image features and use the channel attention mechanism to select informative local details across the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage Enhancement Techniques · Advanced Neural Network Applications · Image and Signal Denoising Methods

MethodsAttention Is All You Need · Linear Layer · Absolute Position Encodings · Label Smoothing · Softmax · Adam · Position-Wise Feed-Forward Layer · Layer Normalization · Byte Pair Encoding · Residual Connection