TANGO: Text-driven Photorealistic and Robust 3D Stylization via Lighting Decomposition
Yongwei Chen, Rui Chen, Jiabao Lei, Yabin Zhang, Kui Jia

TL;DR
TANGO is a novel method that enables photorealistic, text-driven 3D style transfer by disentangling appearance, geometry, and lighting, working effectively on arbitrary meshes without task-specific training.
Contribution
It introduces a disentangled approach using CLIP supervision and a differentiable renderer to achieve robust, photorealistic 3D stylization from text prompts without specialized datasets.
Findings
Outperforms existing methods in photorealism and consistency
Works effectively on low-quality meshes
Does not require task-specific training data
Abstract
Creation of 3D content by stylization is a promising yet challenging problem in computer vision and graphics research. In this work, we focus on stylizing photorealistic appearance renderings of a given surface mesh of arbitrary topology. Motivated by the recent surge of cross-modal supervision of the Contrastive Language-Image Pre-training (CLIP) model, we propose TANGO, which transfers the appearance style of a given 3D shape according to a text prompt in a photorealistic manner. Technically, we propose to disentangle the appearance style as the spatially varying bidirectional reflectance distribution function, the local geometric variation, and the lighting condition, which are jointly optimized, via supervision of the CLIP loss, by a spherical Gaussians based differentiable renderer. As such, TANGO enables photorealistic 3D style transfer by automatically predicting reflectance…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · 3D Shape Modeling and Analysis · Computer Graphics and Visualization Techniques
MethodsContrastive Language-Image Pre-training
