TL;DR
This paper introduces behavioral exploration, a method for training agents to adapt and explore in real-time by learning from expert demonstrations, enabling fast online adaptation in robotics and manipulation tasks.
Contribution
The paper proposes a novel approach that trains a generative model to internalize exploration and adaptation from expert data, allowing agents to perform targeted, in-context exploration and rapid behavior adaptation.
Findings
Effective in simulated locomotion and manipulation tasks
Successful real-world robotic manipulation demonstrations
Enables fast online exploration and adaptation
Abstract
Developing autonomous agents that quickly explore an environment and adapt their behavior online is a canonical challenge in robotics and machine learning. While humans are able to achieve such fast online exploration and adaptation, often acquiring new information and skills in only a handful of interactions, existing algorithmic approaches tend to rely on random exploration and slow, gradient-based behavior updates. How can we endow autonomous agents with such capabilities on par with humans? Taking inspiration from recent progress on both in-context learning and large-scale behavioral cloning, in this work we propose behavioral exploration: training agents to internalize what it means to explore and adapt in-context over the space of ``expert'' behaviors. To achieve this, given access to a dataset of expert demonstrations, we train a long-context generative model to predict expert…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
