360-MLC: Multi-view Layout Consistency for Self-training and   Hyper-parameter Tuning

Bolivar Solarte; Chin-Hsuan Wu; Yueh-Cheng Liu; Yi-Hsuan Tsai; Min Sun

arXiv:2210.12935·cs.CV·October 25, 2022·1 cites

360-MLC: Multi-view Layout Consistency for Self-training and Hyper-parameter Tuning

Bolivar Solarte, Chin-Hsuan Wu, Yueh-Cheng Liu, Yi-Hsuan Tsai, Min Sun

PDF

Open Access 1 Repo 1 Video

TL;DR

360-MLC introduces a self-training approach leveraging multi-view layout consistency in 360-images to adapt monocular room-layout models to new domains without ground truth annotations.

Contribution

It proposes a novel multi-view consistency-based self-training method for monocular layout estimation that also enables hyper-parameter tuning without labeled data.

Findings

01

Achieves competitive performance compared to state-of-the-art methods.

02

Effectively uses uncertainty and entropy for pseudo-label weighting and model evaluation.

03

Demonstrates successful domain adaptation on a new multi-view dataset.

Abstract

We present 360-MLC, a self-training method based on multi-view layout consistency for finetuning monocular room-layout models using unlabeled 360-images only. This can be valuable in practical scenarios where a pre-trained model needs to be adapted to a new data domain without using any ground truth annotations. Our simple yet effective assumption is that multiple layout estimations in the same scene must define a consistent geometry regardless of their camera positions. Based on this idea, we leverage a pre-trained model to project estimated layout boundaries from several camera views into the 3D world coordinate. Then, we re-project them back to the spherical coordinate and build a probability function, from which we sample the pseudo-labels for self-training. To handle unconfident pseudo-labels, we evaluate the variance in the re-projected boundaries as an uncertainty value to weight…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

EnriqueSolarte/360-mlc
pytorchOfficial

Videos

360-MLC: Multi-view Layout Consistency for Self-training and Hyper-parameter Tuning· slideslive

Taxonomy

TopicsAdvanced Vision and Imaging · Robotics and Sensor-Based Localization · Advanced Image and Video Retrieval Techniques