Learning Latent Part-Whole Hierarchies for Point Clouds
Xiang Gao, Wei Hu, Renjie Liao

TL;DR
This paper introduces a weakly supervised latent variable model that explicitly learns part-whole hierarchies in 3D point clouds, improving segmentation accuracy at multiple levels.
Contribution
It proposes a novel encoder-decoder framework with discrete latent variables for hierarchical 3D point cloud segmentation, trained with weak supervision.
Findings
Achieves state-of-the-art top-level part segmentation results.
Effectively segments middle-level latent subparts.
Demonstrates robustness with different inference algorithms.
Abstract
Strong evidence suggests that humans perceive the 3D world by parsing visual scenes and objects into part-whole hierarchies. Although deep neural networks have the capability of learning powerful multi-level representations, they can not explicitly model part-whole hierarchies, which limits their expressiveness and interpretability in processing 3D vision data such as point clouds. To this end, we propose an encoder-decoder style latent variable model that explicitly learns the part-whole hierarchies for the multi-level point cloud segmentation. Specifically, the encoder takes a point cloud as input and predicts the per-point latent subpart distribution at the middle level. The decoder takes the latent variable and the feature from the encoder as an input and predicts the per-point part distribution at the top level. During training, only annotated part labels at the top level are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topics3D Shape Modeling and Analysis · 3D Surveying and Cultural Heritage · Advanced Neural Network Applications
MethodsREINFORCE
