Physically Disentangled Representations

Tzofi Klinghoffer; Kushagra Tiwary; Arkadiusz Balata; Vivek Sharma,; Ramesh Raskar

arXiv:2204.05281·cs.CV·April 12, 2022·1 cites

Physically Disentangled Representations

Tzofi Klinghoffer, Kushagra Tiwary, Arkadiusz Balata, Vivek Sharma,, Ramesh Raskar

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel inverse rendering-based method for learning physically disentangled scene representations, improving downstream task performance and robustness to scene variations without supervision.

Contribution

It proposes a new inverse rendering approach with a Leave-One-Out, Cycle Contrastive loss to enhance physical disentanglement in representations.

Findings

01

Higher accuracy in downstream tasks by up to 18%

02

Improved robustness to lighting and viewpoint variations

03

Outperforms semantically disentangled methods across multiple tasks

Abstract

State-of-the-art methods in generative representation learning yield semantic disentanglement, but typically do not consider physical scene parameters, such as geometry, albedo, lighting, or camera. We posit that inverse rendering, a way to reverse the rendering process to recover scene parameters from an image, can also be used to learn physically disentangled representations of scenes without supervision. In this paper, we show the utility of inverse rendering in learning representations that yield improved accuracy on downstream clustering, linear classification, and segmentation tasks with the help of our novel Leave-One-Out, Cycle Contrastive loss (LOOCC), which improves disentanglement of scene parameters and robustness to out-of-distribution lighting and viewpoints. We perform a comparison of our method with other generative representation learning methods across a variety of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

tzofi/physically-disentangled-representations
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Face recognition and analysis · Digital Media Forensic Detection