AnimeColor: Reference-based Animation Colorization with Diffusion Transformers

Yuhong Zhang; Liyao Wang; Han Wang; Danni Wu; Zuzeng Lin; Feng Wang; Li Song

arXiv:2507.20158·cs.CV·July 29, 2025

AnimeColor: Reference-based Animation Colorization with Diffusion Transformers

Yuhong Zhang, Liyao Wang, Han Wang, Danni Wu, Zuzeng Lin, Feng Wang, Li Song

PDF

TL;DR

AnimeColor introduces a diffusion transformer-based framework for reference-based animation colorization, achieving superior color accuracy, temporal consistency, and visual quality through innovative components and training strategies.

Contribution

It presents a novel diffusion transformer framework with high-level and low-level color extractors for improved animation colorization.

Findings

01

Outperforms existing methods in color accuracy

02

Enhances temporal consistency in animation colorization

03

Achieves high visual quality in generated animations

Abstract

Animation colorization plays a vital role in animation production, yet existing methods struggle to achieve color accuracy and temporal consistency. To address these challenges, we propose \textbf{AnimeColor}, a novel reference-based animation colorization framework leveraging Diffusion Transformers (DiT). Our approach integrates sketch sequences into a DiT-based video diffusion model, enabling sketch-controlled animation generation. We introduce two key components: a High-level Color Extractor (HCE) to capture semantic color information and a Low-level Color Guider (LCG) to extract fine-grained color details from reference images. These components work synergistically to guide the video diffusion process. Additionally, we employ a multi-stage training strategy to maximize the utilization of reference image color information. Extensive experiments demonstrate that AnimeColor outperforms…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.