From Pixels to Torques with Linear Feedback

Jeong Hun Lee; Sam Schoedel; Aditya Bhardwaj; Zachary Manchester

arXiv:2406.18699·cs.RO·December 11, 2024

From Pixels to Torques with Linear Feedback

Jeong Hun Lee, Sam Schoedel, Aditya Bhardwaj, Zachary Manchester

PDF

Open Access

TL;DR

This paper introduces a simple, linear observer-based feedback approach for robotic control directly from raw camera images, demonstrating effectiveness in simulation and real hardware for stabilization and swing-up tasks.

Contribution

The paper presents a novel method to learn linear image-based observers and controllers from demonstration data, enabling direct pixels-to-torques control with stability guarantees.

Findings

01

Linear feedback policies perform well on cartpole tasks.

02

Method is robust to noise, model mismatch, and occlusions.

03

Koopman embedding extends the approach to nonlinear systems.

Abstract

We demonstrate the effectiveness of simple observer-based linear feedback policies for "pixels-to-torques" control of robotic systems using only a robot-facing camera. Specifically, we show that the matrices of an image-based Luenberger observer (linear state estimator) for a "student" output-feedback policy can be learned from demonstration data provided by a "teacher" state-feedback policy via simple linear-least-squares regression. The resulting linear output-feedback controller maps directly from high-dimensional raw images to torques while being amenable to the rich set of analytical tools from linear systems theory, allowing us to enforce closed-loop stability constraints in the learning problem. We also investigate a nonlinear extension of the method via the Koopman embedding. Finally, we demonstrate the surprising effectiveness of linear pixels-to-torques policies on a cartpole…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComputability, Logic, AI Algorithms