Mamba as a motion encoder for robotic imitation learning
Toshiaki Tsuji

TL;DR
This paper introduces Mamba as an effective motion encoder for robotic imitation learning, demonstrating its ability to compress sequential data and outperform Transformers in success rates despite higher estimation errors.
Contribution
The paper presents Mamba, a novel architecture that functions as an encoder for imitation learning, effectively capturing temporal dynamics and improving task success rates over existing models.
Findings
Mamba achieves higher success rates than Transformers in practical tasks.
Mamba effectively compresses sequential information while preserving essential dynamics.
Mamba can serve as a real-time motion generator with limited training data.
Abstract
Recent advancements in imitation learning, particularly with the integration of LLM techniques, are set to significantly improve robots' dexterity and adaptability. This paper proposes using Mamba, a state-of-the-art architecture with potential applications in LLMs, for robotic imitation learning, highlighting its ability to function as an encoder that effectively captures contextual information. By reducing the dimensionality of the state space, Mamba operates similarly to an autoencoder. It effectively compresses the sequential information into state variables while preserving the essential temporal dynamics necessary for accurate motion prediction. Experimental results in tasks such as cup placing and case loading demonstrate that despite exhibiting higher estimation errors, Mamba achieves superior success rates compared to Transformers in practical task execution. This performance…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotic Locomotion and Control · Robotic Mechanisms and Dynamics · Robot Manipulation and Learning
