DCT-Net: Domain-Calibrated Translation for Portrait Stylization
Yifang Men, Yuan Yao, Miaomiao Cui, Zhouhui Lian, Xuansong Xie

TL;DR
DCT-Net is a new image translation architecture designed for few-shot portrait stylization, capable of producing high-quality, generalizable stylized images even with limited style exemplars, by calibrating content and leveraging local-global structure.
Contribution
The paper introduces DCT-Net, a novel architecture that improves few-shot portrait stylization through content calibration, geometric expansion, and texture translation modules, enabling high-fidelity and versatile stylization.
Findings
Outperforms state-of-the-art head stylization methods.
Effective in full-body image translation with adaptive deformations.
Handles complex scenes with occlusions and accessories.
Abstract
This paper introduces DCT-Net, a novel image translation architecture for few-shot portrait stylization. Given limited style exemplars (100), the new architecture can produce high-quality style transfer results with advanced ability to synthesize high-fidelity contents and strong generality to handle complicated scenes (e.g., occlusions and accessories). Moreover, it enables full-body image translation via one elegant evaluation network trained by partial observations (i.e., stylized heads). Few-shot learning based style transfer is challenging since the learned model can easily become overfitted in the target domain, due to the biased distribution formed by only a few training examples. This paper aims to handle the challenge by adopting the key idea of "calibration first, translation later" and exploring the augmented global structure with locally-focused translation.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Human Motion and Animation · Face recognition and analysis
MethodsAdapter
