MEDeA: Multi-view Efficient Depth Adjustment
Mikhail Artemyev, Anna Vorontsova, Anna Sokolova, Alexander Limonov

TL;DR
MEDeA is a fast, multi-view depth adjustment method that improves depth consistency across frames using only RGB data, outperforming existing approaches on multiple benchmarks.
Contribution
It introduces a novel, efficient multi-view test-time depth adjustment technique that does not require additional data like normals or optical flow.
Findings
Sets new state-of-the-art on TUM RGB-D, 7Scenes, and ScanNet.
Achieves an order of magnitude faster processing than existing methods.
Successfully handles smartphone-captured data from ARKitScenes.
Abstract
The majority of modern single-view depth estimation methods predict relative depth and thus cannot be directly applied in many real-world scenarios, despite impressive performance in the benchmarks. Moreover, single-view approaches cannot guarantee consistency across a sequence of frames. Consistency is typically addressed with test-time optimization of discrepancy across views; however, it takes hours to process a single scene. In this paper, we present MEDeA, an efficient multi-view test-time depth adjustment method, that is an order of magnitude faster than existing test-time approaches. Given RGB frames with camera parameters, MEDeA predicts initial depth maps, adjusts them by optimizing local scaling coefficients, and outputs temporally-consistent depth maps. Contrary to test-time methods requiring normals, optical flow, or semantics estimation, MEDeA produces high-quality…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCCD and CMOS Imaging Sensors
MethodsConvolution · Batch Normalization · *Communicated@Fast*How Do I Communicate to Expedia? · 1x1 Convolution · Thinned U-shape Module
