HOIMotion: Forecasting Human Motion During Human-Object Interactions Using Egocentric 3D Object Bounding Boxes
Zhiming Hu, Zheming Yin, Daniel Haeufle, Syn Schmitt, Andreas Bulling

TL;DR
HOIMotion is a new method that improves human motion forecasting during human-object interactions by integrating egocentric 3D object bounding boxes with past body poses, outperforming existing methods.
Contribution
The paper introduces HOIMotion, which fuses pose and object features using graph convolutional networks for enhanced motion prediction during interactions.
Findings
Outperforms state-of-the-art by up to 8.7% on ADT dataset
Achieves up to 7.2% improvement on MoGaze dataset
Forecasted poses are perceived as more precise and realistic
Abstract
We present HOIMotion - a novel approach for human motion forecasting during human-object interactions that integrates information about past body poses and egocentric 3D object bounding boxes. Human motion forecasting is important in many augmented reality applications but most existing methods have only used past body poses to predict future motion. HOIMotion first uses an encoder-residual graph convolutional network (GCN) and multi-layer perceptrons to extract features from body poses and egocentric 3D object bounding boxes, respectively. Our method then fuses pose and object features into a novel pose-object graph and uses a residual-decoder GCN to forecast future body motion. We extensively evaluate our method on the Aria digital twin (ADT) and MoGaze datasets and show that HOIMotion consistently outperforms state-of-the-art methods by a large margin of up to 8.7% on ADT and 7.2% on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Motion and Animation
MethodsGraph Convolutional Network · Adaptive Richard's Curve Weighted Activation
