ReLIC: A Recipe for 64k Steps of In-Context Reinforcement Learning for Embodied AI
Ahmad Elawady, Gunjan Chhablani, Ram Ramrakhya, Karmesh Yadav, Dhruv, Batra, Zsolt Kira, Andrew Szot

TL;DR
ReLIC introduces a novel in-context reinforcement learning approach enabling embodied agents to adapt rapidly to new environments using extensive experience, novel policy updates, and long observation histories, outperforming existing meta-RL methods.
Contribution
The paper presents ReLIC, a new method combining large-scale RL, partial updates, and Sink-KV mechanisms for effective in-context learning in embodied AI.
Findings
ReLIC outperforms meta-RL baselines in unseen house navigation tasks.
ReLIC can perform few-shot imitation learning without expert demonstrations.
The combination of large-scale RL, partial updates, and Sink-KV is crucial for success.
Abstract
Intelligent embodied agents need to quickly adapt to new scenarios by integrating long histories of experience into decision-making. For instance, a robot in an unfamiliar house initially wouldn't know the locations of objects needed for tasks and might perform inefficiently. However, as it gathers more experience, it should learn the layout of its environment and remember where objects are, allowing it to complete new tasks more efficiently. To enable such rapid adaptation to new tasks, we present ReLIC, a new approach for in-context reinforcement learning (RL) for embodied agents. With ReLIC, agents are capable of adapting to new environments using 64,000 steps of in-context experience with full attention while being trained through self-generated experience via RL. We achieve this by proposing a novel policy update scheme for on-policy RL called "partial updates'' as well as a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics
MethodsSoftmax · Attention Is All You Need · ReLIC
