Egocentric Visual Self-Modeling for Autonomous Robot Dynamics Prediction   and Adaptation

Yuhang Hu; Boyuan Chen; Hod Lipson

arXiv:2207.03386·cs.RO·March 19, 2024

Egocentric Visual Self-Modeling for Autonomous Robot Dynamics Prediction and Adaptation

Yuhang Hu, Boyuan Chen, Hod Lipson

PDF

Open Access

TL;DR

This paper introduces a novel egocentric visual self-model for robots that learns their dynamics from a single camera in a self-supervised way, enabling autonomous detection of damage and adaptation without prior knowledge.

Contribution

It presents the first task-agnostic visual self-model learned solely from first-person camera input, capable of detecting damage and adapting behavior across different robot configurations.

Findings

01

Successfully learned dynamic self-model from visual input

02

Enabled autonomous damage detection and adaptation

03

Validated generalizability across multiple robot types

Abstract

The ability of robots to model their own dynamics is key to autonomous planning and learning, as well as for autonomous damage detection and recovery. Traditionally, dynamic models are pre-programmed or learned from external observations. Here, we demonstrate for the first time how a task-agnostic dynamic self-model can be learned using only a single first-person-view camera in a self-supervised manner, without any prior knowledge of robot morphology, kinematics, or task. Through experiments on a 12-DoF robot, we demonstrate the capabilities of the model in basic locomotion tasks using visual input. Notably, the robot can autonomously detect anomalies, such as damaged components, and adapt its behavior, showcasing resilience in dynamic environments. Furthermore, the model's generalizability was validated across robots with different configurations, emphasizing its potential as a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Topics3D Shape Modeling and Analysis · Advanced Vision and Imaging · Robotic Locomotion and Control