Segmentation and Recovery of Superquadric Models using Convolutional Neural Networks
Jaka \v{S}ircelj, Tim Oblak, Klemen Grm, Uro\v{s} Petkovi\'c, Ale\v{s}, Jakli\v{c}, Peter Peer, Vitomir \v{S}truc, Franc Solina

TL;DR
This paper introduces a two-stage CNN-based method for segmenting complex 3D depth scenes into superquadric models, enabling efficient and interpretable shape representation.
Contribution
The paper presents a novel CNN-based approach combining Mask R-CNN and a regressor for fast, accurate superquadric segmentation and modeling of 3D scenes.
Findings
Competitive performance on synthetic and real data
Faster decomposition compared to existing methods
Effective representation with few parameters
Abstract
In this paper we address the problem of representing 3D visual data with parameterized volumetric shape primitives. Specifically, we present a (two-stage) approach built around convolutional neural networks (CNNs) capable of segmenting complex depth scenes into the simpler geometric structures that can be represented with superquadric models. In the first stage, our approach uses a Mask RCNN model to identify superquadric-like structures in depth scenes and then fits superquadric models to the segmented structures using a specially designed CNN regressor. Using our approach we are able to describe complex structures with a small number of interpretable parameters. We evaluated the proposed approach on synthetic as well as real-world depth data and show that our solution does not only result in competitive performance in comparison to the state-of-the-art, but is able to decompose scenes…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · 3D Shape Modeling and Analysis · Human Pose and Action Recognition
