MEDeA: Multi-view Efficient Depth Adjustment

Mikhail Artemyev; Anna Vorontsova; Anna Sokolova; Alexander Limonov

arXiv:2406.12048·cs.CV·June 19, 2024

MEDeA: Multi-view Efficient Depth Adjustment

Mikhail Artemyev, Anna Vorontsova, Anna Sokolova, Alexander Limonov

PDF

Open Access

TL;DR

MEDeA is a fast, multi-view depth adjustment method that improves depth consistency across frames using only RGB data, outperforming existing approaches on multiple benchmarks.

Contribution

It introduces a novel, efficient multi-view test-time depth adjustment technique that does not require additional data like normals or optical flow.

Findings

01

Sets new state-of-the-art on TUM RGB-D, 7Scenes, and ScanNet.

02

Achieves an order of magnitude faster processing than existing methods.

03

Successfully handles smartphone-captured data from ARKitScenes.

Abstract

The majority of modern single-view depth estimation methods predict relative depth and thus cannot be directly applied in many real-world scenarios, despite impressive performance in the benchmarks. Moreover, single-view approaches cannot guarantee consistency across a sequence of frames. Consistency is typically addressed with test-time optimization of discrepancy across views; however, it takes hours to process a single scene. In this paper, we present MEDeA, an efficient multi-view test-time depth adjustment method, that is an order of magnitude faster than existing test-time approaches. Given RGB frames with camera parameters, MEDeA predicts initial depth maps, adjusts them by optimizing local scaling coefficients, and outputs temporally-consistent depth maps. Contrary to test-time methods requiring normals, optical flow, or semantics estimation, MEDeA produces high-quality…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsCCD and CMOS Imaging Sensors

MethodsConvolution · Batch Normalization · *Communicated@Fast*How Do I Communicate to Expedia? · 1x1 Convolution · Thinned U-shape Module