IDArb: Intrinsic Decomposition for Arbitrary Number of Input Views and   Illuminations

Zhibing Li; Tong Wu; Jing Tan; Mengchen Zhang; Jiaqi Wang; Dahua Lin

arXiv:2412.12083·cs.CV·April 2, 2025

IDArb: Intrinsic Decomposition for Arbitrary Number of Input Views and Illuminations

Zhibing Li, Tong Wu, Jing Tan, Mengchen Zhang, Jiaqi Wang, Dahua Lin

PDF

Open Access 1 Video

TL;DR

IDArb is a diffusion-based model that performs intrinsic decomposition on multiple views and illuminations, accurately estimating surface normals and material properties while maintaining multi-view consistency.

Contribution

We introduce IDArb, a novel diffusion model with a cross-view attention module and view-adaptive training for multi-view intrinsic decomposition under varying lighting.

Findings

01

Outperforms state-of-the-art methods in accuracy and consistency.

02

Supports diverse downstream tasks like relighting and 3D reconstruction.

03

Introduces ARB-Objaverse dataset for training and evaluation.

Abstract

Capturing geometric and material information from images remains a fundamental challenge in computer vision and graphics. Traditional optimization-based methods often require hours of computational time to reconstruct geometry, material properties, and environmental lighting from dense multi-view inputs, while still struggling with inherent ambiguities between lighting and material. On the other hand, learning-based approaches leverage rich material priors from existing 3D object datasets but face challenges with maintaining multi-view consistency. In this paper, we introduce IDArb, a diffusion-based model designed to perform intrinsic decomposition on an arbitrary number of images under varying illuminations. Our method achieves accurate and multi-view consistent estimation on surface normals and material properties. This is made possible through a novel cross-view, cross-domain…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

IDArb: Intrinsic Decomposition for Arbitrary Number of Input Views and Illuminations· slideslive

Taxonomy

TopicsComputer Graphics and Visualization Techniques

MethodsSoftmax · Attention Is All You Need