Data-Efficient Approach to Humanoid Control via Fine-Tuning a Pre-Trained GPT on Action Data
Siddharth Padmanabhan (1), Kazuki Miyazawa (1), Takato Horii (1),, Takayuki Nagai (1, 2) ((1) Osaka University, (2) The University of, Electro-Communications)

TL;DR
This paper introduces a data-efficient method for humanoid control by fine-tuning a pre-trained GPT model on motion data, enabling realistic and human-like movements with less training time and data.
Contribution
It presents a novel approach of using a pre-trained GPT model fine-tuned on smaller datasets for humanoid control, addressing data and training efficiency issues.
Findings
GPT can generate physically plausible humanoid motions
Fine-tuning on smaller datasets reduces training time
Model achieves realistic, human-like movements in physics simulations
Abstract
There are several challenges in developing a model for multi-tasking humanoid control. Reinforcement learning and imitation learning approaches are quite popular in this domain. However, there is a trade-off between the two. Reinforcement learning is not the best option for training a humanoid to perform multiple behaviors due to training time and model size, and imitation learning using kinematics data alone is not appropriate to realize the actual physics of the motion. Training models to perform multiple complex tasks take long training time due to high DoF and complexities of the movements. Although training models offline would be beneficial, another issue is the size of the dataset, usually being quite large to encapsulate multiple movements. There are few implementations of transformer-based models to control humanoid characters and predict their motion based on a large dataset…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReal-time simulation and control systems
