HOI4ABOT: Human-Object Interaction Anticipation for Human Intention Reading Collaborative roBOTs
Esteve Valls Mascaro, Daniel Sliwowski, Dongheui Lee

TL;DR
This paper introduces HOI4ABOT, a transformer-based model that improves human-object interaction anticipation in videos, enabling robots to better understand and proactively assist humans in collaborative tasks.
Contribution
The paper presents a novel, efficient transformer model for HOI anticipation that outperforms existing methods in accuracy and speed, facilitating improved human-robot collaboration.
Findings
Outperforms state-of-the-art in HOI detection and anticipation
Increases mAP by 1.76% and 1.04% respectively
15.4 times faster than previous models
Abstract
Robots are becoming increasingly integrated into our lives, assisting us in various tasks. To ensure effective collaboration between humans and robots, it is essential that they understand our intentions and anticipate our actions. In this paper, we propose a Human-Object Interaction (HOI) anticipation framework for collaborative robots. We propose an efficient and robust transformer-based model to detect and anticipate HOIs from videos. This enhanced anticipation empowers robots to proactively assist humans, resulting in more efficient and intuitive collaborations. Our model outperforms state-of-the-art results in HOI detection and anticipation in VidHOI dataset with an increase of 1.76% and 1.04% in mAP respectively while being 15.4 times faster. We showcase the effectiveness of our approach through experimental results in a real robot, demonstrating that the robot's ability to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Human Pose and Action Recognition · Advanced Neural Network Applications
