Segmentation and Recovery of Superquadric Models using Convolutional   Neural Networks

Jaka \v{S}ircelj; Tim Oblak; Klemen Grm; Uro\v{s} Petkovi\'c; Ale\v{s}; Jakli\v{c}; Peter Peer; Vitomir \v{S}truc; Franc Solina

arXiv:2001.10504·cs.CV·January 29, 2020·5 cites

Segmentation and Recovery of Superquadric Models using Convolutional Neural Networks

Jaka \v{S}ircelj, Tim Oblak, Klemen Grm, Uro\v{s} Petkovi\'c, Ale\v{s}, Jakli\v{c}, Peter Peer, Vitomir \v{S}truc, Franc Solina

PDF

Open Access

TL;DR

This paper introduces a two-stage CNN-based method for segmenting complex 3D depth scenes into superquadric models, enabling efficient and interpretable shape representation.

Contribution

The paper presents a novel CNN-based approach combining Mask R-CNN and a regressor for fast, accurate superquadric segmentation and modeling of 3D scenes.

Findings

01

Competitive performance on synthetic and real data

02

Faster decomposition compared to existing methods

03

Effective representation with few parameters

Abstract

In this paper we address the problem of representing 3D visual data with parameterized volumetric shape primitives. Specifically, we present a (two-stage) approach built around convolutional neural networks (CNNs) capable of segmenting complex depth scenes into the simpler geometric structures that can be represented with superquadric models. In the first stage, our approach uses a Mask RCNN model to identify superquadric-like structures in depth scenes and then fits superquadric models to the segmented structures using a specially designed CNN regressor. Using our approach we are able to describe complex structures with a small number of interpretable parameters. We evaluated the proposed approach on synthetic as well as real-world depth data and show that our solution does not only result in competitive performance in comparison to the state-of-the-art, but is able to decompose scenes…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · 3D Shape Modeling and Analysis · Human Pose and Action Recognition