Box6D : Zero-shot Category-level 6D Pose Estimation of Warehouse Boxes
Yintao Ma, Sajjad Pakdamansavoji, Amir Rasouli, Tongtong Cao

TL;DR
Box6D is a novel category-level 6D pose estimation method for warehouse boxes that uses a single RGB-D image, infers dimensions efficiently, and achieves high accuracy with significantly reduced inference time, suitable for industrial applications.
Contribution
It introduces a fast, category-specific 6D pose estimation approach that leverages a CAD template, dimension inference, and hypothesis rejection to improve efficiency and accuracy in warehouse scenarios.
Findings
Achieves competitive or superior 6D pose accuracy.
Reduces inference time by approximately 76%.
Effective in real-world storage scenarios and benchmarks.
Abstract
Accurate and efficient 6D pose estimation of novel objects under clutter and occlusion is critical for robotic manipulation across warehouse automation, bin picking, logistics, and e-commerce fulfillment. There are three main approaches in this domain; Model-based methods assume an exact CAD model at inference but require high-resolution meshes and transfer poorly to new environments; Model-free methods that rely on a few reference images or videos are more flexible, however often fail under challenging conditions; Category-level approaches aim to balance flexibility and accuracy but many are overly general and ignore environment and object priors, limiting their practicality in industrial settings. To this end, we propose Box6d, a category-level 6D pose estimation method tailored for storage boxes in the warehouse context. From a single RGB-D observation, Box6D infers the dimensions…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · Robotics and Sensor-Based Localization · Soft Robotics and Applications
