EXIM: A Hybrid Explicit-Implicit Representation for Text-Guided 3D Shape Generation
Zhengzhe Liu, Jingyu Hu, Ka-Hei Hui, Xiaojuan Qi, Daniel Cohen-Or,, Chi-Wing Fu

TL;DR
This paper introduces EXIM, a hybrid explicit-implicit 3D shape representation technique guided by text, enabling high-fidelity, style-consistent shape generation from natural language without extensive optimization or human annotations.
Contribution
The paper proposes a novel hybrid explicit-implicit shape representation for text-guided 3D shape generation, improving fidelity and coherence over existing methods.
Findings
Outperforms state-of-the-art methods in shape-text coherence
Generates high-quality 3D shapes from natural language descriptions
Enables style-consistent indoor scene generation
Abstract
This paper presents a new text-guided technique for generating 3D shapes. The technique leverages a hybrid 3D shape representation, namely EXIM, combining the strengths of explicit and implicit representations. Specifically, the explicit stage controls the topology of the generated 3D shapes and enables local modifications, whereas the implicit stage refines the shape and paints it with plausible colors. Also, the hybrid approach separates the shape and color and generates color conditioned on shape to ensure shape-color consistency. Unlike the existing state-of-the-art methods, we achieve high-fidelity shape generation from natural-language descriptions without the need for time-consuming per-shape optimization or reliance on human-annotated texts during training or test-time optimization. Further, we demonstrate the applicability of our approach to generate indoor scenes with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topics3D Shape Modeling and Analysis · Human Motion and Animation · Image Processing and 3D Reconstruction
