JieHua Paintings Style Feature Extracting Model using Stable Diffusion with ControlNet
Yujia Gu, Haofeng Li, Xinyu Fang, Zihan Peng, Yinan Peng

TL;DR
This paper introduces a novel style feature extraction method for Jiehua paintings using a fine-tuned Stable Diffusion Model with ControlNet, outperforming CycleGAN in style transfer quality and preserving semantic content.
Contribution
The study presents a new approach combining Stable Diffusion and ControlNet for Jiehua style extraction, with optimized hyperparameters showing superior performance over existing models.
Findings
FSDMC achieves an FID of 3.27 on Jiehua dataset.
FSDMC surpasses CycleGAN in expert evaluations.
The method effectively preserves semantic information during style transfer.
Abstract
This study proposes a novel approach to extract stylistic features of Jiehua: the utilization of the Fine-tuned Stable Diffusion Model with ControlNet (FSDMC) to refine depiction techniques from artists' Jiehua. The training data for FSDMC is based on the opensource Jiehua artist's work collected from the Internet, which were subsequently manually constructed in the format of (Original Image, Canny Edge Features, Text Prompt). By employing the optimal hyperparameters identified in this paper, it was observed FSDMC outperforms CycleGAN, another mainstream style transfer model. FSDMC achieves FID of 3.27 on the dataset and also surpasses CycleGAN in terms of expert evaluation. This not only demonstrates the model's high effectiveness in extracting Jiehua's style features, but also preserves the original pre-trained semantic information. The findings of this study suggest that the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDigital Media and Visual Art · Industrial Vision Systems and Defect Detection · 3D Shape Modeling and Analysis
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Instance Normalization · Convolution · Batch Normalization · PatchGAN · Sigmoid Activation · Diffusion · Cycle Consistency Loss · GAN Least Squares Loss · Residual Connection
