DeFormer: Integrating Transformers with Deformable Models for 3D Shape   Abstraction from a Single Image

Di Liu; Xiang Yu; Meng Ye; Qilong Zhangli; Zhuowei Li; Zhixing Zhang,; Dimitris N. Metaxas

arXiv:2309.12594·cs.CV·October 5, 2023

DeFormer: Integrating Transformers with Deformable Models for 3D Shape Abstraction from a Single Image

Di Liu, Xiang Yu, Meng Ye, Qilong Zhangli, Zhuowei Li, Zhixing Zhang,, Dimitris N. Metaxas

PDF

Open Access

TL;DR

DeFormer is a novel Transformer-based approach that uses deformable models to accurately abstract complex 3D shapes from a single image with fewer primitives, enhancing detail and interpretability.

Contribution

It introduces a bi-channel Transformer architecture combined with deformable primitives for efficient 3D shape abstraction from single images.

Findings

01

Outperforms state-of-the-art in shape reconstruction accuracy

02

Uses fewer primitives for broader geometric coverage

03

Provides consistent semantic correspondences for interpretability

Abstract

Accurate 3D shape abstraction from a single 2D image is a long-standing problem in computer vision and graphics. By leveraging a set of primitives to represent the target shape, recent methods have achieved promising results. However, these methods either use a relatively large number of primitives or lack geometric flexibility due to the limited expressibility of the primitives. In this paper, we propose a novel bi-channel Transformer architecture, integrated with parameterized deformable models, termed DeFormer, to simultaneously estimate the global and local deformations of primitives. In this way, DeFormer can abstract complex object shapes while using a small number of primitives which offer a broader geometry coverage and finer details. Then, we introduce a force-driven dynamic fitting and a cycle-consistent re-projection loss to optimize the primitive parameters. Extensive…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Topics3D Shape Modeling and Analysis · Advanced Vision and Imaging · Medical Image Segmentation Techniques

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Adam · Residual Connection · Layer Normalization · Label Smoothing · Byte Pair Encoding · Dropout · Softmax