Multimodal Semantic-Aware Automatic Colorization with Diffusion Prior

Han Wang; Xinning Chai; Yiwen Wang; Yuhong Zhang; Rong Xie; Li Song

arXiv:2404.16678·cs.CV·April 26, 2024

Multimodal Semantic-Aware Automatic Colorization with Diffusion Prior

Han Wang, Xinning Chai, Yiwen Wang, Yuhong Zhang, Rong Xie, Li Song

PDF

Open Access

TL;DR

This paper introduces a novel multimodal diffusion-based pipeline for automatic image colorization that enhances color saturation, semantic accuracy, and visual quality, outperforming previous methods in realism and user preference.

Contribution

It proposes a diffusion prior with luminance guidance and multimodal semantic priors to improve colorization quality and realism.

Findings

01

Outperforms previous methods in perceptual realism.

02

Generates saturated, semantically plausible colors.

03

Surpasses prior approaches in human preference studies.

Abstract

Colorizing grayscale images offers an engaging visual experience. Existing automatic colorization methods often fail to generate satisfactory results due to incorrect semantic colors and unsaturated colors. In this work, we propose an automatic colorization pipeline to overcome these challenges. We leverage the extraordinary generative ability of the diffusion prior to synthesize color with plausible semantics. To overcome the artifacts introduced by the diffusion prior, we apply the luminance conditional guidance. Moreover, we adopt multimodal high-level semantic priors to help the model understand the image content and deliver saturated colors. Besides, a luminance-aware decoder is designed to restore details and enhance overall visual quality. The proposed pipeline synthesizes saturated colors while maintaining plausible semantics. Experiments indicate that our proposed method…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsColor perception and design

MethodsColorization · Diffusion