DCT-Net: Domain-Calibrated Translation for Portrait Stylization

Yifang Men; Yuan Yao; Miaomiao Cui; Zhouhui Lian; Xuansong Xie

arXiv:2207.02426·cs.CV·July 7, 2022

DCT-Net: Domain-Calibrated Translation for Portrait Stylization

Yifang Men, Yuan Yao, Miaomiao Cui, Zhouhui Lian, Xuansong Xie

PDF

Open Access 3 Repos

TL;DR

DCT-Net is a new image translation architecture designed for few-shot portrait stylization, capable of producing high-quality, generalizable stylized images even with limited style exemplars, by calibrating content and leveraging local-global structure.

Contribution

The paper introduces DCT-Net, a novel architecture that improves few-shot portrait stylization through content calibration, geometric expansion, and texture translation modules, enabling high-fidelity and versatile stylization.

Findings

01

Outperforms state-of-the-art head stylization methods.

02

Effective in full-body image translation with adaptive deformations.

03

Handles complex scenes with occlusions and accessories.

Abstract

This paper introduces DCT-Net, a novel image translation architecture for few-shot portrait stylization. Given limited style exemplars ( $\sim$ 100), the new architecture can produce high-quality style transfer results with advanced ability to synthesize high-fidelity contents and strong generality to handle complicated scenes (e.g., occlusions and accessories). Moreover, it enables full-body image translation via one elegant evaluation network trained by partial observations (i.e., stylized heads). Few-shot learning based style transfer is challenging since the learned model can easily become overfitted in the target domain, due to the biased distribution formed by only a few training examples. This paper aims to handle the challenge by adopting the key idea of "calibration first, translation later" and exploring the augmented global structure with locally-focused translation.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Human Motion and Animation · Face recognition and analysis

MethodsAdapter