Planning from Observation and Interaction

Tyler Han; Siyang Shen; Rohan Baijal; Harine Ravichandiran; Bat Nemekhbold; Kevin Huang; Sanghun Jung; Byron Boots

arXiv:2602.24121·cs.RO·March 2, 2026

Planning from Observation and Interaction

Tyler Han, Siyang Shen, Rohan Baijal, Harine Ravichandiran, Bat Nemekhbold, Kevin Huang, Sanghun Jung, Byron Boots

PDF

Open Access

TL;DR

This paper introduces a planning-based IRL algorithm enabling real-world robot learning from observation and interaction alone, achieving efficient, scratch learning of manipulation tasks without prior data or pre-training.

Contribution

It presents a novel IRL-based world modeling approach that operates solely on observations and interaction, demonstrating effective real-world manipulation learning from scratch.

Findings

01

Learned world model enables real-time transfer learning in robots.

02

Achieves manipulation tasks in under an hour without prior data.

03

Outperforms existing IRL, RL, and BC methods in sample efficiency.

Abstract

Observational learning requires an agent to learn to perform a task by referencing only observations of the performed task. This work investigates the equivalent setting in real-world robot learning where access to hand-designed rewards and demonstrator actions are not assumed. To address this data-constrained setting, this work presents a planning-based Inverse Reinforcement Learning (IRL) algorithm for world modeling from observation and interaction alone. Experiments conducted entirely in the real-world demonstrate that this paradigm is effective for learning image-based manipulation tasks from scratch in under an hour, without assuming prior knowledge, pre-training, or data of any kind beyond task observations. Moreover, this work demonstrates that the learned world model representation is capable of online transfer learning in the real-world from scratch. In comparison to existing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Robot Manipulation and Learning · Multimodal Machine Learning Applications