CNS: Correspondence Encoded Neural Image Servo Policy
Anzhe Chen, Hongxiang Yu, Yue Wang, Rong Xiong

TL;DR
This paper introduces CNS, a graph neural network-based image servo policy that encodes keypoints and correspondence for high-precision, robust robotic positioning, demonstrating superior generalization and efficiency in simulation and real-world tests.
Contribution
The paper proposes a novel graph neural network architecture encoding keypoints and correspondence for image servo, enhancing generalization, precision, and efficiency in robotic control.
Findings
Achieves <0.3° and sub-millimeter accuracy
Operates at up to 40fps in real-time
Generalizes from simulation to real-world without fine-tuning
Abstract
Image servo is an indispensable technique in robotic applications that helps to achieve high precision positioning. The intermediate representation of image servo policy is important to sensor input abstraction and policy output guidance. Classical approaches achieve high precision but require clean keypoint correspondence, and suffer from limited convergence basin or weak feature error robustness. Recent learning-based methods achieve moderate precision and large convergence basin on specific scenes but face issues when generalizing to novel environments. In this paper, we encode keypoints and correspondence into a graph and use graph neural network as architecture of controller. This design utilizes both advantages: generalizable intermediate representation from keypoint correspondence and strong modeling ability from neural network. Other techniques including realistic data…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGlioma Diagnosis and Treatment · Medical Image Segmentation Techniques · Advanced Neural Network Applications
