Decision Mamba Architectures

Andr\'e Correia; Lu\'is A. Alexandre

arXiv:2405.07943·cs.LG·October 18, 2024

Decision Mamba Architectures

Andr\'e Correia, Lu\'is A. Alexandre

PDF

Open Access 2 Repos

TL;DR

This paper introduces Decision Mamba and Hierarchical Decision Mamba architectures, which enhance Transformer-based imitation learning models, outperforming existing methods across various environments and demonstrating significant improvements in task performance.

Contribution

The paper presents two novel Mamba architectures that improve upon Transformer models for imitation learning, with extensive experiments validating their effectiveness.

Findings

01

Mamba models outperform Transformers in most tasks

02

Decision Mamba outperforms other methods in diverse settings

03

Extensive experiments across OpenAI Gym and D4RL environments

Abstract

Recent advancements in imitation learning have been largely fueled by the integration of sequence models, which provide a structured flow of information to effectively mimic task behaviours. Currently, Decision Transformer (DT) and subsequently, the Hierarchical Decision Transformer (HDT), presented Transformer-based approaches to learn task policies. Recently, the Mamba architecture has shown to outperform Transformers across various task domains. In this work, we introduce two novel methods, Decision Mamba (DM) and Hierarchical Decision Mamba (HDM), aimed at enhancing the performance of the Transformer models. Through extensive experimentation across diverse environments such as OpenAI Gym and D4RL, leveraging varying demonstration data sets, we demonstrate the superiority of Mamba models over their Transformer counterparts in a majority of tasks. Results show that DM outperforms…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications

MethodsAttention Is All You Need · Linear Layer · Multi-Head Attention · Dense Connections · Position-Wise Feed-Forward Layer · Dropout · Label Smoothing · Residual Connection · Absolute Position Encodings · Byte Pair Encoding