SSIMBaD: Sigma Scaling with SSIM-Guided Balanced Diffusion for AnimeFace Colorization
Junpyo Seo, Hanbin Koo, Jieun Yook, and Byung-Ro Moon (Department of Computer Science, Seoul National University)

TL;DR
This paper introduces SSIMBaD, a diffusion-based framework that improves anime face colorization by maintaining structural fidelity and stylistic transfer, using SSIM-guided sigma scaling for balanced and perceptually consistent reconstructions.
Contribution
The paper presents a novel SSIM-guided sigma scaling method within diffusion models to enhance anime face colorization, ensuring structural preservation and style transfer with improved perceptual quality.
Findings
Outperforms state-of-the-art models in pixel accuracy
Achieves higher perceptual quality in anime face colorization
Demonstrates effective generalization across diverse styles
Abstract
We propose a novel diffusion-based framework for automatic colorization of Anime-style facial sketches. Our method preserves the structural fidelity of the input sketch while effectively transferring stylistic attributes from a reference image. Unlike traditional approaches that rely on predefined noise schedules - which often compromise perceptual consistency -- our framework builds on continuous-time diffusion models and introduces SSIMBaD (Sigma Scaling with SSIM-Guided Balanced Diffusion). SSIMBaD applies a sigma-space transformation that aligns perceptual degradation, as measured by structural similarity (SSIM), in a linear manner. This scaling ensures uniform visual difficulty across timesteps, enabling more balanced and faithful reconstructions. Experiments on a large-scale Anime face dataset demonstrate that our method outperforms state-of-the-art models in both pixel accuracy…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Face recognition and analysis · Evolutionary Psychology and Human Behavior
MethodsDiffusion · Colorization
