APEX: Unsupervised, Object-Centric Scene Segmentation and Tracking for Robot Manipulation
Yizhe Wu, Oiwi Parker Jones, Martin Engelcke, and Ingmar Posner

TL;DR
APEX is a novel unsupervised object segmentation and tracking model that effectively handles complex scenes in robot manipulation, outperforming current methods and enabling improved robot skill execution.
Contribution
We introduce APEX, a new latent-variable model with a mask normalization algorithm and high-resolution encoder for better scene understanding in robotics.
Findings
APEX outperforms state-of-the-art in unsupervised segmentation and tracking.
APEX achieves high accuracy on real-world and simulated datasets.
Segmentations improve robot manipulation performance.
Abstract
Recent advances in unsupervised learning for object detection, segmentation, and tracking hold significant promise for applications in robotics. A common approach is to frame these tasks as inference in probabilistic latent-variable models. In this paper, however, we show that the current state-of-the-art struggles with visually complex scenes such as typically encountered in robot manipulation tasks. We propose APEX, a new latent-variable model which is able to segment and track objects in more realistic scenes featuring objects that vary widely in size and texture, including the robot arm itself. This is achieved by a principled mask normalisation algorithm and a high-resolution scene encoder. To evaluate our approach, we present results on the real-world Sketchy dataset. This dataset, however, does not contain ground truth masks and object IDs for a quantitative evaluation. We thus…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Domain Adaptation and Few-Shot Learning · Video Surveillance and Tracking Methods
