Ensemble of LSTMs and feature selection for human action prediction
Tomislav Petkovi\'c, Luka Petrovi\'c, Ivan Markovi\'c, Ivan, Petrovi\'c

TL;DR
This paper introduces an ensemble of LSTM networks combined with feature selection to improve human action prediction in robotics, utilizing the comprehensive MoGaze dataset to predict object grasping and location visitation with enhanced accuracy.
Contribution
It presents a novel ensemble LSTM approach with feature selection for human action prediction, outperforming single-cue baselines on the MoGaze dataset.
Findings
LSTM ensemble slightly outperforms gaze baseline in object prediction
The model achieves better accuracy in macro location prediction
Without gaze, the LSTM model significantly outperforms single cue baselines
Abstract
As robots are becoming more and more ubiquitous in human environments, it will be necessary for robotic systems to better understand and predict human actions. However, this is not an easy task, at times not even for us humans, but based on a relatively structured set of possible actions, appropriate cues, and the right model, this problem can be computationally tackled. In this paper, we propose to use an ensemble of long-short term memory (LSTM) networks for human action prediction. To train and evaluate models, we used the MoGaze dataset - currently the most comprehensive dataset capturing poses of human joints and the human gaze. We have thoroughly analyzed the MoGaze dataset and selected a reduced set of cues for this task. Our model can predict (i) which of the labeled objects the human is going to grasp, and (ii) which of the macro locations the human is going to visit (such as…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Gaze Tracking and Assistive Technology · Hand Gesture Recognition Systems
