TANGO: Text-driven Photorealistic and Robust 3D Stylization via Lighting   Decomposition

Yongwei Chen; Rui Chen; Jiabao Lei; Yabin Zhang; Kui Jia

arXiv:2210.11277·cs.CV·November 4, 2022·33 cites

TANGO: Text-driven Photorealistic and Robust 3D Stylization via Lighting Decomposition

Yongwei Chen, Rui Chen, Jiabao Lei, Yabin Zhang, Kui Jia

PDF

Open Access 1 Repo 1 Video

TL;DR

TANGO is a novel method that enables photorealistic, text-driven 3D style transfer by disentangling appearance, geometry, and lighting, working effectively on arbitrary meshes without task-specific training.

Contribution

It introduces a disentangled approach using CLIP supervision and a differentiable renderer to achieve robust, photorealistic 3D stylization from text prompts without specialized datasets.

Findings

01

Outperforms existing methods in photorealism and consistency

02

Works effectively on low-quality meshes

03

Does not require task-specific training data

Abstract

Creation of 3D content by stylization is a promising yet challenging problem in computer vision and graphics research. In this work, we focus on stylizing photorealistic appearance renderings of a given surface mesh of arbitrary topology. Motivated by the recent surge of cross-modal supervision of the Contrastive Language-Image Pre-training (CLIP) model, we propose TANGO, which transfers the appearance style of a given 3D shape according to a text prompt in a photorealistic manner. Technically, we propose to disentangle the appearance style as the spatially varying bidirectional reflectance distribution function, the local geometric variation, and the lighting condition, which are jointly optimized, via supervision of the CLIP loss, by a spherical Gaussians based differentiable renderer. As such, TANGO enables photorealistic 3D style transfer by automatically predicting reflectance…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Gorilla-Lab-SCUT/tango
pytorchOfficial

Videos

TANGO: Text-driven Photorealistic and Robust 3D Stylization via Lighting Decomposition· slideslive

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · 3D Shape Modeling and Analysis · Computer Graphics and Visualization Techniques

MethodsContrastive Language-Image Pre-training