KITchen: A Real-World Benchmark and Dataset for 6D Object Pose Estimation in Kitchen Environments
Abdelrahman Younes, Tamim Asfour

TL;DR
KITchen introduces a comprehensive real-world dataset and benchmark for 6D object pose estimation in kitchen environments, addressing the gap between existing datasets and real-world robotic manipulation challenges.
Contribution
The paper presents a new dataset, benchmark, and semi-automated annotation pipeline tailored for 6D pose estimation in diverse kitchen settings, reflecting real-world robotic tasks.
Findings
Dataset includes 205k RGBD images of 111 objects in kitchens.
Semi-automated annotation pipeline reduces labeling effort.
Benchmark enables evaluation of pose estimation methods in realistic scenarios.
Abstract
Despite the recent progress on 6D object pose estimation methods for robotic grasping, a substantial performance gap persists between the capabilities of these methods on existing datasets and their efficacy in real-world grasping and mobile manipulation tasks, particularly when robots rely solely on their monocular egocentric field of view (FOV). Existing real-world datasets primarily focus on table-top grasping scenarios, where a robot arm is placed in a fixed position and the objects are centralized within the FOV of fixed external camera(s). Assessing performance on such datasets may not accurately reflect the challenges encountered in everyday grasping and mobile manipulation tasks within kitchen environments such as retrieving objects from higher shelves, sinks, dishwashers, ovens, refrigerators, or microwaves. To address this gap, we present KITchen, a novel benchmark designed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Hand Gesture Recognition Systems · Robot Manipulation and Learning
