TL;DR
This paper presents a real-time deep learning system for visual perception, grasp detection, and visual servo control in autonomous robotic manipulation, achieving high accuracy and generalization in dynamic environments.
Contribution
The work introduces a novel real-time neural network framework for combined grasp detection and visual servoing, trained on automatically generated datasets for improved accuracy and speed.
Findings
Achieved millimeter-level accuracy in grasp positioning.
Demonstrated real-time performance in dynamic environments.
Validated system effectiveness on a Kinova Gen3 robot.
Abstract
In order to explore robotic grasping in unstructured and dynamic environments, this work addresses the visual perception phase involved in the task. This phase involves the processing of visual data to obtain the location of the object to be grasped, its pose and the points at which the robot`s grippers must make contact to ensure a stable grasp. For this, the Cornell Grasping dataset is used to train a convolutional neural network that, having an image of the robot`s workspace, with a certain object, is able to predict a grasp rectangle that symbolizes the position, orientation and opening of the robot`s grippers before its closing. In addition to this network, which runs in real-time, another one is designed to deal with situations in which the object moves in the environment. Therefore, the second network is trained to perform a visual servo control, ensuring that the object remains…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
