In-Context Decision Transformer: Reinforcement Learning via Hierarchical   Chain-of-Thought

Sili Huang; Jifeng Hu; Hechang Chen; Lichao Sun; Bo Yang

arXiv:2405.20692·cs.LG·June 3, 2024

In-Context Decision Transformer: Reinforcement Learning via Hierarchical Chain-of-Thought

Sili Huang, Jifeng Hu, Hechang Chen, Lichao Sun, Bo Yang

PDF

Open Access 1 Repo

TL;DR

This paper introduces the In-Context Decision Transformer (IDT), a hierarchical approach that improves efficiency and performance in offline reinforcement learning by focusing on high-level decisions, enabling faster online evaluation.

Contribution

The paper proposes IDT, a hierarchical decision transformer that reduces sequence length and computational costs in in-context RL, achieving state-of-the-art results in long-horizon tasks.

Findings

01

IDT achieves 36x faster evaluation in D4RL benchmark.

02

IDT outperforms existing methods on long-horizon tasks.

03

Hierarchical structure improves efficiency and effectiveness.

Abstract

In-context learning is a promising approach for offline reinforcement learning (RL) to handle online tasks, which can be achieved by providing task prompts. Recent works demonstrated that in-context RL could emerge with self-improvement in a trial-and-error manner when treating RL tasks as an across-episodic sequential prediction problem. Despite the self-improvement not requiring gradient updates, current works still suffer from high computational costs when the across-episodic sequence increases with task horizons. To this end, we propose an In-context Decision Transformer (IDT) to achieve self-improvement in a high-level trial-and-error manner. Specifically, IDT is inspired by the efficient hierarchical structure of human decision-making and thus reconstructs the sequence to consist of high-level decisions instead of low-level actions that interact with environments. As one…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

silihuang-ai/idt
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMental Health Research Topics

MethodsAttention Is All You Need · Linear Layer · Byte Pair Encoding · Label Smoothing · Adam · Residual Connection · Position-Wise Feed-Forward Layer · Multi-Head Attention · Dropout · Dense Connections