Multi-Style Facial Sketch Synthesis through Masked Generative Modeling
Bowen Sun, Guo Lu, Shibao Zheng

TL;DR
This paper introduces a lightweight, end-to-end multi-style facial sketch synthesis model that leverages semi-supervised learning and masked generative transformers to produce high-quality, stylized sketches from facial photos, overcoming data and style limitations.
Contribution
The study presents a novel multi-style facial sketch synthesis approach using masked generative modeling and semi-supervised learning, improving quality and diversity without extra inputs.
Findings
Outperforms previous methods on multiple benchmarks.
Produces continuous stylized sketches with accurate facial features.
Effectively handles data scarcity and style diversity challenges.
Abstract
The facial sketch synthesis (FSS) model, capable of generating sketch portraits from given facial photographs, holds profound implications across multiple domains, encompassing cross-modal face recognition, entertainment, art, media, among others. However, the production of high-quality sketches remains a formidable task, primarily due to the challenges and flaws associated with three key factors: (1) the scarcity of artist-drawn data, (2) the constraints imposed by limited style types, and (3) the deficiencies of processing input information in existing models. To address these difficulties, we propose a lightweight end-to-end synthesis model that efficiently converts images to corresponding multi-stylized sketches, obviating the necessity for any supplementary inputs (\eg, 3D geometry). In this study, we overcome the issue of data insufficiency by incorporating semi-supervised…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace recognition and analysis · Generative Adversarial Networks and Image Synthesis · Face Recognition and Perception
