Virtual avatar generation models as world navigators

Sai Mandava

arXiv:2406.01056·cs.CV·June 4, 2024

Virtual avatar generation models as world navigators

Sai Mandava

PDF

Open Access

TL;DR

This paper presents SABR-CLIMB, a diffusion transformer model that generates realistic human movement in climbing environments using a large dataset, aiming to advance virtual avatar applications in robotics, sports, and healthcare.

Contribution

Introduction of SABR-CLIMB, a diffusion transformer model for virtual avatar generation trained on a large proprietary dataset for complex movement tasks.

Findings

01

Successful simulation of human climbing movements

02

Effective use of large proprietary dataset NAV-22M

03

Potential applications in robotics, sports, and healthcare

Abstract

We introduce SABR-CLIMB, a novel video model simulating human movement in rock climbing environments using a virtual avatar. Our diffusion transformer predicts the sample instead of noise in each diffusion step and ingests entire videos to output complete motion sequences. By leveraging a large proprietary dataset, NAV-22M, and substantial computational resources, we showcase a proof of concept for a system to train general-purpose virtual avatars for complex tasks in robotics, sports, and healthcare.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Motion and Animation

MethodsDiffusion