Cross-Episodic Curriculum for Transformer Agents

Lucy Xiaoyang Shi; Yunfan Jiang; Jake Grigsby; Linxi "Jim"; Fan; Yuke Zhu

arXiv:2310.08549·cs.LG·October 13, 2023·1 cites

Cross-Episodic Curriculum for Transformer Agents

Lucy Xiaoyang Shi, Yunfan Jiang, Jake Grigsby, Linxi "Jim", Fan, Yuke Zhu

PDF

Open Access 1 Video

TL;DR

The paper introduces Cross-Episodic Curriculum (CEC), a novel algorithm that enhances Transformer agents' learning efficiency and generalization by structuring cross-episodic experiences into a curriculum, demonstrated in multi-task reinforcement and imitation learning scenarios.

Contribution

This work presents CEC, a new curriculum learning method that leverages cross-episodic experiences within Transformers to improve learning and generalization across tasks and data qualities.

Findings

01

CEC improves policy performance in multi-task reinforcement learning.

02

CEC enhances generalization in imitation learning with mixed-quality data.

03

Code implementation is publicly available for further research.

Abstract

We present a new algorithm, Cross-Episodic Curriculum (CEC), to boost the learning efficiency and generalization of Transformer agents. Central to CEC is the placement of cross-episodic experiences into a Transformer's context, which forms the basis of a curriculum. By sequentially structuring online learning trials and mixed-quality demonstrations, CEC constructs curricula that encapsulate learning progression and proficiency increase across episodes. Such synergy combined with the potent pattern recognition capabilities of Transformer models delivers a powerful cross-episodic attention mechanism. The effectiveness of CEC is demonstrated under two representative scenarios: one involving multi-task reinforcement learning with discrete control, such as in DeepMind Lab, where the curriculum captures the learning progression in both individual and progressively complex settings; and the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Cross-Episodic Curriculum for Transformer Agents· slideslive

Taxonomy

TopicsReinforcement Learning in Robotics

MethodsAttention Is All You Need · Dense Connections · Dropout · Byte Pair Encoding · Softmax · Layer Normalization · Linear Layer · Position-Wise Feed-Forward Layer · Absolute Position Encodings · Label Smoothing