Data-Efficient Approach to Humanoid Control via Fine-Tuning a   Pre-Trained GPT on Action Data

Siddharth Padmanabhan (1); Kazuki Miyazawa (1); Takato Horii (1),; Takayuki Nagai (1; 2) ((1) Osaka University; (2) The University of; Electro-Communications)

arXiv:2405.18695·cs.RO·June 18, 2024

Data-Efficient Approach to Humanoid Control via Fine-Tuning a Pre-Trained GPT on Action Data

Siddharth Padmanabhan (1), Kazuki Miyazawa (1), Takato Horii (1),, Takayuki Nagai (1, 2) ((1) Osaka University, (2) The University of, Electro-Communications)

PDF

Open Access

TL;DR

This paper introduces a data-efficient method for humanoid control by fine-tuning a pre-trained GPT model on motion data, enabling realistic and human-like movements with less training time and data.

Contribution

It presents a novel approach of using a pre-trained GPT model fine-tuned on smaller datasets for humanoid control, addressing data and training efficiency issues.

Findings

01

GPT can generate physically plausible humanoid motions

02

Fine-tuning on smaller datasets reduces training time

03

Model achieves realistic, human-like movements in physics simulations

Abstract

There are several challenges in developing a model for multi-tasking humanoid control. Reinforcement learning and imitation learning approaches are quite popular in this domain. However, there is a trade-off between the two. Reinforcement learning is not the best option for training a humanoid to perform multiple behaviors due to training time and model size, and imitation learning using kinematics data alone is not appropriate to realize the actual physics of the motion. Training models to perform multiple complex tasks take long training time due to high DoF and complexities of the movements. Although training models offline would be beneficial, another issue is the size of the dataset, usually being quite large to encapsulate multiple movements. There are few implementations of transformer-based models to control humanoid characters and predict their motion based on a large dataset…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReal-time simulation and control systems