SemanticPaint: A Framework for the Interactive Segmentation of 3D Scenes

Stuart Golodetz; Michael Sapienza; Julien P. C. Valentin; Vibhav; Vineet; Ming-Ming Cheng; Anurag Arnab; Victor A. Prisacariu; Olaf K\"ahler,; Carl Yuheng Ren; David W. Murray; Shahram Izadi; Philip H. S. Torr

arXiv:1510.03727·cs.CV·September 5, 2017·42 cites

SemanticPaint: A Framework for the Interactive Segmentation of 3D Scenes

Stuart Golodetz, Michael Sapienza, Julien P. C. Valentin, Vibhav, Vineet, Ming-Ming Cheng, Anurag Arnab, Victor A. Prisacariu, Olaf K\"ahler,, Carl Yuheng Ren, David W. Murray, Shahram Izadi, Philip H. S. Torr

PDF

Open Access

TL;DR

SemanticPaint is a real-time system that enables interactive segmentation and labeling of 3D scenes using user input, machine learning, and geometric reconstruction, facilitating efficient scene understanding and annotation.

Contribution

It introduces an open-source, real-time framework combining interactive segmentation, user input, and online learning for 3D scene annotation.

Findings

01

Real-time 3D scene segmentation and labeling

02

User interaction improves segmentation accuracy

03

System operates efficiently in live environments

Abstract

We present an open-source, real-time implementation of SemanticPaint, a system for geometric reconstruction, object-class segmentation and learning of 3D scenes. Using our system, a user can walk into a room wearing a depth camera and a virtual reality headset, and both densely reconstruct the 3D scene and interactively segment the environment into object classes such as 'chair', 'floor' and 'table'. The user interacts physically with the real-world scene, touching objects and using voice commands to assign them appropriate labels. These user-generated labels are leveraged by an online random forest-based machine learning algorithm, which is used to predict labels for previously unseen parts of the scene. The entire pipeline runs in real time, and the user stays 'in the loop' throughout the process, receiving immediate feedback about the progress of the labelling and interacting with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Advanced Image and Video Retrieval Techniques · Robotics and Sensor-Based Localization