ImageManip: Image-based Robotic Manipulation with Affordance-guided Next   View Selection

Xiaoqi Li; Yanzi Wang; Yan Shen; Ponomarenko Iaroslav; Haoran Lu,; Qianxu Wang; Boshi An; Jiaming Liu; Hao Dong

arXiv:2310.09069·cs.RO·October 16, 2023·2 cites

ImageManip: Image-based Robotic Manipulation with Affordance-guided Next View Selection

Xiaoqi Li, Yanzi Wang, Yan Shen, Ponomarenko Iaroslav, Haoran Lu,, Qianxu Wang, Boshi An, Jiaming Liu, Hao Dong

PDF

Open Access

TL;DR

This paper introduces an image-based robotic manipulation framework that uses multi-view RGB images and inferred depth to improve manipulation of 3D objects, reducing reliance on costly point cloud data.

Contribution

The novel framework combines multi-view RGB imaging with depth inference and affordance-guided view selection for effective 3D object manipulation.

Findings

01

Effective multi-view depth and affordance map fusion

02

Improved manipulation accuracy over point cloud-based methods

03

Practical deployment demonstrated through real-world experiments

Abstract

In the realm of future home-assistant robots, 3D articulated object manipulation is essential for enabling robots to interact with their environment. Many existing studies make use of 3D point clouds as the primary input for manipulation policies. However, this approach encounters challenges due to data sparsity and the significant cost associated with acquiring point cloud data, which can limit its practicality. In contrast, RGB images offer high-resolution observations using cost effective devices but lack spatial 3D geometric information. To overcome these limitations, we present a novel image-based robotic manipulation framework. This framework is designed to capture multiple perspectives of the target object and infer depth information to complement its geometry. Initially, the system employs an eye-on-hand RGB camera to capture an overall view of the target object. It predicts the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Robotics and Sensor-Based Localization · Robot Manipulation and Learning