A Mixture of Experts Approach to 3D Human Motion Prediction

Edmund Shieh; Joshua Lee Franco; Kang Min Bae; Tej Lalvani

arXiv:2405.06088·cs.CV·May 13, 2024

A Mixture of Experts Approach to 3D Human Motion Prediction

Edmund Shieh, Joshua Lee Franco, Kang Min Bae, Tej Lalvani

PDF

Open Access 1 Repo

TL;DR

This paper evaluates existing 3D human motion prediction models, replicates a state-of-the-art transformer, and introduces a Mixture of Experts architecture within the attention layer to improve real-time inference efficiency.

Contribution

It proposes a novel Soft MoE integrated into a spatio-temporal transformer for faster, scalable human motion prediction.

Findings

01

Soft MoE improves inference speed

02

Replicated SOTA transformer performance

03

Code is publicly available

Abstract

This project addresses the challenge of human motion prediction, a critical area for applications such as au- tonomous vehicle movement detection. Previous works have emphasized the need for low inference times to provide real time performance for applications like these. Our primary objective is to critically evaluate existing model ar- chitectures, identifying their advantages and opportunities for improvement by replicating the state-of-the-art (SOTA) Spatio-Temporal Transformer model as best as possible given computational con- straints. These models have surpassed the limitations of RNN-based models and have demonstrated the ability to generate plausible motion sequences over both short and long term horizons through the use of spatio-temporal rep- resentations. We also propose a novel architecture to ad- dress challenges of real time inference speed by incorpo- rating a Mixture of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

edshieh/motionprediction
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Gait Recognition and Analysis · Video Surveillance and Tracking Methods

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Cosine Annealing · Attention Dropout · Position-Wise Feed-Forward Layer · Dropout · Linear Warmup With Cosine Annealing · Label Smoothing · Residual Connection · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings