MODEST: Multi-Optics Depth-of-Field Stereo Dataset
Nisarg K. Trivedi, Vinayak A. Belludi, Li-Yun Wang

TL;DR
The paper introduces a high-resolution stereo DSLR dataset with extensive optical configurations to improve depth estimation and related tasks under real camera conditions.
Contribution
It provides the first large-scale, high-fidelity stereo DSLR dataset with comprehensive optical variation, enabling better evaluation and training of depth and optical models.
Findings
Enables controlled analysis of optical effects on depth estimation.
Reveals challenges faced by current state-of-the-art methods.
Supports evaluation of classical and learning-based calibration techniques.
Abstract
Reliable depth estimation under real optical conditions remains a core challenge for camera vision in systems such as autonomous robotics and augmented reality. Despite recent progress in depth estimation and depth-of-field rendering, research remains constrained by the lack of large-scale, high-fidelity, real stereo DSLR datasets, limiting real-world generalization and evaluation of models trained on synthetic data as shown extensively in literature. We present the first high-resolution (54723648px) stereo DSLR dataset with 18000 images, systematically varying focal length and aperture across complex real scenes and capturing the optical realism and complexity of professional camera systems. For 9 scenes with varying scene complexity, lighting and background, images are captured with two identical camera assemblies at 10 focal lengths (28-70mm) and 5 apertures (f/2.8-f/22),…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
