FantasyStyle: Controllable Stylized Distillation for 3D Gaussian Splatting
Yitong Yang, Yinglin Wang, Changshuo Wang, Huajie Wang, Shuting He

TL;DR
FantasyStyle introduces a novel 3D Gaussian Splatting style transfer method that enhances multi-view consistency and controllability by leveraging diffusion model distillation and negative guidance, outperforming previous approaches.
Contribution
It is the first to rely entirely on diffusion model distillation for 3D style transfer, addressing multi-view inconsistency and content leakage issues with novel frequency and guidance techniques.
Findings
Outperforms state-of-the-art methods in stylization quality
Achieves higher visual realism across various scenes and styles
Effectively suppresses content leakage and style conflicts
Abstract
The success of 3DGS in generative and editing applications has sparked growing interest in 3DGS-based style transfer. However, current methods still face two major challenges: (1) multi-view inconsistency often leads to style conflicts, resulting in appearance smoothing and distortion; and (2) heavy reliance on VGG features, which struggle to disentangle style and content from style images, often causing content leakage and excessive stylization. To tackle these issues, we introduce \textbf{FantasyStyle}, a 3DGS-based style transfer framework, and the first to rely entirely on diffusion model distillation. It comprises two key components: (1) \textbf{Multi-View Frequency Consistency}. We enhance cross-view consistency by applying a 3D filter to multi-view noisy latent, selectively reducing low-frequency components to mitigate stylized prior conflicts. (2) \textbf{Controllable Stylized…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsFire Detection and Safety Systems
