MRD: Using Physically Based Differentiable Rendering to Probe Vision Models for 3D Scene Understanding
Benjamin Beilharz, Thomas S. A. Wallis

TL;DR
This paper introduces MRD, a differentiable rendering approach that probes vision models' implicit understanding of 3D scene properties by finding physically different scenes that produce similar model activations.
Contribution
MRD provides a physically grounded method to analyze vision models' sensitivity and invariance to 3D scene attributes, advancing interpretability of deep learning models.
Findings
High similarity in model activation between target and optimized scenes.
Qualitative reconstructions reveal models' sensitivities to physical scene attributes.
MRD enables analysis of how physical scene parameters influence model responses.
Abstract
While deep learning methods have achieved impressive success in many vision benchmarks, it remains difficult to understand and explain the representations and decisions of these models. Though vision models are typically trained on 2D inputs, they are often assumed to develop an implicit representation of the underlying 3D scene (for example, showing tolerance to partial occlusion, or the ability to reason about relative depth). Here, we introduce MRD (metamers rendered differentiably), an approach that uses physically based differentiable rendering to probe vision models' implicit understanding of generative 3D scene properties, by finding 3D scene parameters that are physically different but produce the same model activation (i.e. are model metamers). Unlike previous pixel-based methods for evaluating model representations, these reconstruction results are always grounded in physical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topics3D Shape Modeling and Analysis · Computer Graphics and Visualization Techniques · Generative Adversarial Networks and Image Synthesis
