Canonical Space Representation for 4D Panoptic Segmentation of Articulated Objects
Manuel Gomes, Bogdan Raducanu, Miguel Oliveira

TL;DR
This paper introduces a new dataset and a novel 4D panoptic segmentation framework for articulated objects, leveraging temporal dynamics and canonical space alignment to improve segmentation accuracy in dynamic scenarios.
Contribution
The paper presents Artic4D, a new dataset with 4D panoptic annotations, and CanonSeg4D, a framework that explicitly models temporal dynamics using canonical space for improved segmentation.
Findings
CanonSeg4D outperforms existing methods in complex scenarios
Temporal modeling improves segmentation consistency
Canonical alignment enhances part-level segmentation accuracy
Abstract
Articulated object perception presents significant challenges in computer vision, particularly because most existing methods ignore temporal dynamics despite the inherently dynamic nature of such objects. The use of 4D temporal data has not been thoroughly explored in articulated object perception and remains unexamined for panoptic segmentation. The lack of a benchmark dataset further hurt this field. To this end, we introduce Artic4D as a new dataset derived from PartNet Mobility and augmented with synthetic sensor data, featuring 4D panoptic annotations and articulation parameters. Building on this dataset, we propose CanonSeg4D, a novel 4D panoptic segmentation framework. This approach explicitly estimates per-frame offsets mapping observed object parts to a learned canonical space, thereby enhancing part-level segmentation. The framework employs this canonical representation to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Generative Adversarial Networks and Image Synthesis · Robot Manipulation and Learning
