Robot Localisation and 3D Position Estimation Using a Free-Moving Camera and Cascaded Convolutional Neural Networks
Justinas Miseikis, Patrick Knobelreiter, Inka Brijacak, Saeed, Yahyanejad, Kyrre Glette, Ole Jakob Elle, Jim Torresen

TL;DR
This paper introduces a deep learning approach using cascaded CNNs for robot localization and 3D joint estimation from 2D images, enabling flexible sensor placement without fixed calibration.
Contribution
It presents a novel method that allows a moving camera to localize and estimate robot joints in 3D without prior calibration, expanding application flexibility.
Findings
The system accurately localizes robot position in 2D images.
It estimates robot joint positions in 3D with promising accuracy.
The method enables a moving camera to be used effectively for robot tracking.
Abstract
Many works in collaborative robotics and human-robot interaction focuses on identifying and predicting human behaviour while considering the information about the robot itself as given. This can be the case when sensors and the robot are calibrated in relation to each other and often the reconfiguration of the system is not possible, or extra manual work is required. We present a deep learning based approach to remove the constraint of having the need for the robot and the vision sensor to be fixed and calibrated in relation to each other. The system learns the visual cues of the robot body and is able to localise it, as well as estimate the position of robot joints in 3D space by just using a 2D color image. The method uses a cascaded convolutional neural network, and we present the structure of the network, describe our own collected dataset, explain the network training and achieved…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
