CN-RMA: Combined Network with Ray Marching Aggregation for 3D Indoors Object Detection from Multi-view Images
Guanlin Shen, Jingwei Huang, Zhihua Hu, Bin Wang

TL;DR
CN-RMA introduces a novel multi-view 3D indoor object detection method that combines 3D reconstruction and detection networks, using ray marching weights to improve accuracy and occlusion handling, achieving state-of-the-art results.
Contribution
The paper presents a new approach that integrates ray marching with 3D reconstruction and detection networks for enhanced multi-view indoor object detection.
Findings
Achieves state-of-the-art mAP on ScanNet and ARKitScenes datasets.
Effectively handles occlusion through ray marching weights.
End-to-end training improves 3D localization accuracy.
Abstract
This paper introduces CN-RMA, a novel approach for 3D indoor object detection from multi-view images. We observe the key challenge as the ambiguity of image and 3D correspondence without explicit geometry to provide occlusion information. To address this issue, CN-RMA leverages the synergy of 3D reconstruction networks and 3D object detection networks, where the reconstruction network provides a rough Truncated Signed Distance Function (TSDF) and guides image features to vote to 3D space correctly in an end-to-end manner. Specifically, we associate weights to sampled points of each ray through ray marching, representing the contribution of a pixel in an image to corresponding 3D locations. Such weights are determined by the predicted signed distances so that image features vote only to regions near the reconstructed surface. Our method achieves state-of-the-art performance in 3D object…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Advanced Neural Network Applications · Image Enhancement Techniques
