Rethink Arbitrary Style Transfer with Transformer and Contrastive Learning
Zhanjie Zhang, Jiakai Sun, Guangyuan Li, Lei Zhao, Quanwei Zhang,, Zehua Lan, Haolin Yin, Wei Xing, Huaizhong Lin, Zhiwen Zuo

TL;DR
This paper introduces a novel style transfer method using a perception encoder, style consistency normalization, and contrastive learning to produce higher quality stylized images with fewer artifacts.
Contribution
It proposes a new framework combining SCIN, ICL, and PE to improve style transfer quality, addressing limitations of previous cross-attention and adaptive normalization methods.
Findings
Produces higher quality stylized images with fewer artifacts
Outperforms existing state-of-the-art methods in experiments
Enhances style-feature understanding through contrastive learning
Abstract
Arbitrary style transfer holds widespread attention in research and boasts numerous practical applications. The existing methods, which either employ cross-attention to incorporate deep style attributes into content attributes or use adaptive normalization to adjust content features, fail to generate high-quality stylized images. In this paper, we introduce an innovative technique to improve the quality of stylized images. Firstly, we propose Style Consistency Instance Normalization (SCIN), a method to refine the alignment between content and style features. In addition, we have developed an Instance-based Contrastive Learning (ICL) approach designed to understand the relationships among various styles, thereby enhancing the quality of the resulting stylized images. Recognizing that VGG networks are more adept at extracting classification features and need to be better suited for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Text and Document Classification Technologies · Speech Recognition and Synthesis
MethodsDense Connections · Max Pooling · Softmax · Convolution · Dropout · Instance Normalization · Contrastive Learning
