Visual Generalized Coordinates
M. Seetha Ramaiah, Amitabha Mukerjee, Arindam Chakraborty, Sadbodh, Sharma

TL;DR
This paper introduces a novel visual approach to robot configuration space using images, enabling collision detection and path planning without prior knowledge of robot kinematics or environment, demonstrated on simulated and real robots.
Contribution
The paper proposes a new visual generalized coordinate system derived from images, forming a manifold approximated by tangent spaces, enabling visual inverse kinematics and collision detection.
Findings
The visual roadmap (VRM) effectively represents robot configurations in image space.
Collision detection and path planning are performed efficiently in O(n) and O(nlogn) time.
The approach works on both simulated and real robots, regardless of kinematic complexity.
Abstract
An open problem in robotics is that of using vision to identify a robot's own body and the world around it. Many models attempt to recover the traditional C-space parameters. Instead, we propose an alternative C-space by deriving generalized coordinates from images of the robot. We show that the space of such images is bijective to the motion space, so these images lie on a manifold homeomorphic to the canonical C-space. We now approximate this manifold as a set of neighbourhood tangent spaces that result in a graph, which we call the Visual Roadmap (VRM). Given a new robot image, we perform inverse kinematics visually by interpolating between nearby images in the image space. Obstacles are projected onto the VRM in time by superimposition of images, leading to the identification of collision poses. The edges joining the free nodes can now be checked with a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Robotics and Sensor-Based Localization · Advanced Image and Video Retrieval Techniques
