# JSIS3D: Joint Semantic-Instance Segmentation of 3D Point Clouds with   Multi-Task Pointwise Networks and Multi-Value Conditional Random Fields

**Authors:** Quang-Hieu Pham, Duc Thanh Nguyen, Binh-Son Hua, Gemma Roig, Sai-Kit, Yeung

arXiv: 1904.00699 · 2019-04-08

## TL;DR

This paper introduces JSIS3D, a novel approach combining multi-task pointwise networks and multi-value CRFs for joint semantic and instance segmentation of 3D point clouds, achieving state-of-the-art results.

## Contribution

The paper presents a new joint segmentation framework that integrates deep learning with probabilistic graphical models for improved 3D scene understanding.

## Key findings

- Robust joint semantic-instance segmentation on indoor datasets.
- State-of-the-art performance in semantic segmentation.
- Enhanced robustness over individual components.

## Abstract

Deep learning techniques have become the to-go models for most vision-related tasks on 2D images. However, their power has not been fully realised on several tasks in 3D space, e.g., 3D scene understanding. In this work, we jointly address the problems of semantic and instance segmentation of 3D point clouds. Specifically, we develop a multi-task pointwise network that simultaneously performs two tasks: predicting the semantic classes of 3D points and embedding the points into high-dimensional vectors so that points of the same object instance are represented by similar embeddings. We then propose a multi-value conditional random field model to incorporate the semantic and instance labels and formulate the problem of semantic and instance segmentation as jointly optimising labels in the field model. The proposed method is thoroughly evaluated and compared with existing methods on different indoor scene datasets including S3DIS and SceneNN. Experimental results showed the robustness of the proposed joint semantic-instance segmentation scheme over its single components. Our method also achieved state-of-the-art performance on semantic segmentation.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1904.00699/full.md

## Figures

92 figures with captions in the complete paper: https://tomesphere.com/paper/1904.00699/full.md

## References

50 references — full list in the complete paper: https://tomesphere.com/paper/1904.00699/full.md

---
Source: https://tomesphere.com/paper/1904.00699