Online Decision MetaMorphFormer: A Casual Transformer-Based Reinforcement Learning Framework of Universal Embodied Intelligence
Luo Ji, Runji Lin

TL;DR
The paper introduces Online Decision MetaMorphFormer (ODM), a transformer-based reinforcement learning framework that enables universal embodied intelligence through online learning, environment recognition, and action planning, demonstrating strong generalization in diverse tasks.
Contribution
It presents a novel online RL framework using transformers that supports self-awareness, environment recognition, and action planning for universal embodied agents.
Findings
ODM achieves rapid learning with pre-trained datasets.
Demonstrates strong generalization in zero-shot and few-shot environments.
Outperforms existing methods in online experiments.
Abstract
Interactive artificial intelligence in the motion control field is an interesting topic, especially when universal knowledge is adaptive to multiple tasks and universal environments. Despite there being increasing efforts in the field of Reinforcement Learning (RL) with the aid of transformers, most of them might be limited by the offline training pipeline, which prohibits exploration and generalization abilities. To address this limitation, we propose the framework of Online Decision MetaMorphFormer (ODM) which aims to achieve self-awareness, environment recognition, and action planning through a unified model architecture. Motivated by cognitive and behavioral psychology, an ODM agent is able to learn from others, recognize the world, and practice itself based on its own experience. ODM can also be applied to any arbitrary agent with a multi-joint body, located in different…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics
