HOIMotion: Forecasting Human Motion During Human-Object Interactions   Using Egocentric 3D Object Bounding Boxes

Zhiming Hu; Zheming Yin; Daniel Haeufle; Syn Schmitt; Andreas Bulling

arXiv:2407.02633·cs.CV·July 4, 2024

HOIMotion: Forecasting Human Motion During Human-Object Interactions Using Egocentric 3D Object Bounding Boxes

Zhiming Hu, Zheming Yin, Daniel Haeufle, Syn Schmitt, Andreas Bulling

PDF

Open Access

TL;DR

HOIMotion is a new method that improves human motion forecasting during human-object interactions by integrating egocentric 3D object bounding boxes with past body poses, outperforming existing methods.

Contribution

The paper introduces HOIMotion, which fuses pose and object features using graph convolutional networks for enhanced motion prediction during interactions.

Findings

01

Outperforms state-of-the-art by up to 8.7% on ADT dataset

02

Achieves up to 7.2% improvement on MoGaze dataset

03

Forecasted poses are perceived as more precise and realistic

Abstract

We present HOIMotion - a novel approach for human motion forecasting during human-object interactions that integrates information about past body poses and egocentric 3D object bounding boxes. Human motion forecasting is important in many augmented reality applications but most existing methods have only used past body poses to predict future motion. HOIMotion first uses an encoder-residual graph convolutional network (GCN) and multi-layer perceptrons to extract features from body poses and egocentric 3D object bounding boxes, respectively. Our method then fuses pose and object features into a novel pose-object graph and uses a residual-decoder GCN to forecast future body motion. We extensively evaluate our method on the Aria digital twin (ADT) and MoGaze datasets and show that HOIMotion consistently outperforms state-of-the-art methods by a large margin of up to 8.7% on ADT and 7.2% on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Motion and Animation

MethodsGraph Convolutional Network · Adaptive Richard's Curve Weighted Activation