A Robotic Visual Grasping Design: Rethinking Convolution Neural Network with High-Resolutions
Zhangli Zhou, Shaochen Wang, Ziyang Chen, Mingyu Cai, Zhen Kan

TL;DR
This paper introduces HRG-Net, a high-resolution, parallel-branch CNN architecture for robotic visual grasping that maintains spatial details and improves accuracy over traditional low-resolution encoding methods.
Contribution
The paper proposes a novel parallel-branch CNN design, HRG-Net, for robotic grasping that preserves high-resolution features and enhances performance compared to existing approaches.
Findings
HRG-Net improves grasping accuracy in real environments.
The parallel-branch design accelerates network training.
Maintaining high-resolution representations benefits robotic perception.
Abstract
High-resolution representations are important for vision-based robotic grasping problems. Existing works generally encode the input images into low-resolution representations via sub-networks and then recover high-resolution representations. This will lose spatial information, and errors introduced by the decoder will be more serious when multiple types of objects are considered or objects are far away from the camera. To address these issues, we revisit the design paradigm of CNN for robotic perception tasks. We demonstrate that using parallel branches as opposed to serial stacked convolutional layers will be a more powerful design for robotic visual grasping tasks. In particular, guidelines of neural network design are provided for robotic perception tasks, e.g., high-resolution representation and lightweight design, which respond to the challenges in different manipulation scenarios.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Robot Manipulation and Learning · Advanced Neural Network Applications
