IOSG: Image-driven Object Searching and Grasping
Houjian Yu, Xibai Lou, Yang Yang, and Changhyun Choi

TL;DR
This paper introduces IOSG, a robot system that uses image-driven methods and hierarchical policies to search for and grasp novel objects in cluttered environments, achieving high success rates in simulation and real-world tests.
Contribution
The paper presents a novel hierarchical policy framework with a Target Similarity Network for image-driven object search and grasping, trained with self-supervision in simulation.
Findings
Achieves 96.0% success in simulation coordination tasks
Achieves 94.5% success in simulation exploration tasks
Achieves 85.0% success rate on real robot tasks
Abstract
When robots retrieve specific objects from cluttered scenes, such as home and warehouse environments, the target objects are often partially occluded or completely hidden. Robots are thus required to search, identify a target object, and successfully grasp it. Preceding works have relied on pre-trained object recognition or segmentation models to find the target object. However, such methods require laborious manual annotations to train the models and even fail to find novel target objects. In this paper, we propose an Image-driven Object Searching and Grasping (IOSG) approach where a robot is provided with the reference image of a novel target object and tasked to find and retrieve it. We design a Target Similarity Network that generates a probability map to infer the location of the novel target. IOSG learns a hierarchical policy; the high-level policy predicts the subtask type,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Advanced Neural Network Applications · Domain Adaptation and Few-Shot Learning
