MRD: Using Physically Based Differentiable Rendering to Probe Vision Models for 3D Scene Understanding

Benjamin Beilharz; Thomas S. A. Wallis

arXiv:2512.12307·cs.CV·February 24, 2026

MRD: Using Physically Based Differentiable Rendering to Probe Vision Models for 3D Scene Understanding

Benjamin Beilharz, Thomas S. A. Wallis

PDF

Open Access

TL;DR

This paper introduces MRD, a differentiable rendering approach that probes vision models' implicit understanding of 3D scene properties by finding physically different scenes that produce similar model activations.

Contribution

MRD provides a physically grounded method to analyze vision models' sensitivity and invariance to 3D scene attributes, advancing interpretability of deep learning models.

Findings

01

High similarity in model activation between target and optimized scenes.

02

Qualitative reconstructions reveal models' sensitivities to physical scene attributes.

03

MRD enables analysis of how physical scene parameters influence model responses.

Abstract

While deep learning methods have achieved impressive success in many vision benchmarks, it remains difficult to understand and explain the representations and decisions of these models. Though vision models are typically trained on 2D inputs, they are often assumed to develop an implicit representation of the underlying 3D scene (for example, showing tolerance to partial occlusion, or the ability to reason about relative depth). Here, we introduce MRD (metamers rendered differentiably), an approach that uses physically based differentiable rendering to probe vision models' implicit understanding of generative 3D scene properties, by finding 3D scene parameters that are physically different but produce the same model activation (i.e. are model metamers). Unlike previous pixel-based methods for evaluating model representations, these reconstruction results are always grounded in physical…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Topics3D Shape Modeling and Analysis · Computer Graphics and Visualization Techniques · Generative Adversarial Networks and Image Synthesis