Temporal Representation Alignment: Successor Features Enable Emergent   Compositionality in Robot Instruction Following

Vivek Myers; Bill Chunyuan Zheng; Anca Dragan; Kuan Fang; Sergey; Levine

arXiv:2502.05454·cs.RO·February 14, 2025

Temporal Representation Alignment: Successor Features Enable Emergent Compositionality in Robot Instruction Following

Vivek Myers, Bill Chunyuan Zheng, Anca Dragan, Kuan Fang, Sergey, Levine

PDF

Open Access

TL;DR

This paper introduces a method using temporal alignment loss to learn task representations that enable robots to generalize compositionally to new multi-step tasks without explicit planning or reinforcement learning.

Contribution

It demonstrates that temporal representation alignment improves compositional generalization in robotic tasks, even without explicit subtask planning.

Findings

01

Significant improvement in compositional generalization across robotic tasks.

02

Effective in both language and goal image task specifications.

03

Applicable in both real robotic manipulation and simulation environments.

Abstract

Effective task representations should facilitate compositionality, such that after learning a variety of basic tasks, an agent can perform compound tasks consisting of multiple steps simply by composing the representations of the constituent steps together. While this is conceptually simple and appealing, it is not clear how to automatically learn representations that enable this sort of compositionality. We show that learning to associate the representations of current and future states with a temporal alignment loss can improve compositional generalization, even in the absence of any explicit subtask planning or reinforcement learning. We evaluate our approach across diverse robotic manipulation tasks as well as in simulation, showing substantial improvements for tasks specified with either language or goal images.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Teaching and Learning Programming · Social Robot Interaction and HRI