MODEST: Multi-Optics Depth-of-Field Stereo Dataset

Nisarg K. Trivedi; Vinayak A. Belludi; Li-Yun Wang

arXiv:2511.20853·cs.CV·April 21, 2026

MODEST: Multi-Optics Depth-of-Field Stereo Dataset

Nisarg K. Trivedi, Vinayak A. Belludi, Li-Yun Wang

PDF

TL;DR

The paper introduces a high-resolution stereo DSLR dataset with extensive optical configurations to improve depth estimation and related tasks under real camera conditions.

Contribution

It provides the first large-scale, high-fidelity stereo DSLR dataset with comprehensive optical variation, enabling better evaluation and training of depth and optical models.

Findings

01

Enables controlled analysis of optical effects on depth estimation.

02

Reveals challenges faced by current state-of-the-art methods.

03

Supports evaluation of classical and learning-based calibration techniques.

Abstract

Reliable depth estimation under real optical conditions remains a core challenge for camera vision in systems such as autonomous robotics and augmented reality. Despite recent progress in depth estimation and depth-of-field rendering, research remains constrained by the lack of large-scale, high-fidelity, real stereo DSLR datasets, limiting real-world generalization and evaluation of models trained on synthetic data as shown extensively in literature. We present the first high-resolution (5472 $\times$ 3648px) stereo DSLR dataset with 18000 images, systematically varying focal length and aperture across complex real scenes and capturing the optical realism and complexity of professional camera systems. For 9 scenes with varying scene complexity, lighting and background, images are captured with two identical camera assemblies at 10 focal lengths (28-70mm) and 5 apertures (f/2.8-f/22),…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.