Playing Lottery Tickets in Style Transfer Models
Meihao Kong, Jing Huo, Wenbin Li, Jing Wu, Yu-Kun Lai, Yang Gao

TL;DR
This paper empirically investigates the existence of sparse, trainable subnetworks within style transfer models using the lottery ticket hypothesis, demonstrating significant sparsity without performance loss across multiple models.
Contribution
First to verify the presence of lottery ticket subnetworks in style transfer models, enabling high sparsity and potential for efficient model compression.
Findings
Matching subnetworks exist at high sparsity levels (89.2% in AdaIN, 73.7% in SANet)
Pruning feature transformation modules maintains model performance
Lottery ticket hypothesis generalizes across various style transfer models
Abstract
Style transfer has achieved great success and attracted a wide range of attention from both academic and industrial communities due to its flexible application scenarios. However, the dependence on a pretty large VGG-based autoencoder leads to existing style transfer models having high parameter complexities, which limits their applications on resource-constrained devices. Compared with many other tasks, the compression of style transfer models has been less explored. Recently, the lottery ticket hypothesis (LTH) has shown great potential in finding extremely sparse matching subnetworks which can achieve on par or even better performance than the original full networks when trained in isolation. In this work, we for the first time perform an empirical study to verify whether such trainable matching subnetworks also exist in style transfer models. Specifically, we take two most popular…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Video Analysis and Summarization · Music and Audio Processing
MethodsSoftmax · Dropout · Dense Connections · Convolution · Max Pooling · Self-Attention Network
