Ensemble of LSTMs and feature selection for human action prediction

Tomislav Petkovi\'c; Luka Petrovi\'c; Ivan Markovi\'c; Ivan; Petrovi\'c

arXiv:2101.05645·cs.RO·January 15, 2021

Ensemble of LSTMs and feature selection for human action prediction

Tomislav Petkovi\'c, Luka Petrovi\'c, Ivan Markovi\'c, Ivan, Petrovi\'c

PDF

Open Access

TL;DR

This paper introduces an ensemble of LSTM networks combined with feature selection to improve human action prediction in robotics, utilizing the comprehensive MoGaze dataset to predict object grasping and location visitation with enhanced accuracy.

Contribution

It presents a novel ensemble LSTM approach with feature selection for human action prediction, outperforming single-cue baselines on the MoGaze dataset.

Findings

01

LSTM ensemble slightly outperforms gaze baseline in object prediction

02

The model achieves better accuracy in macro location prediction

03

Without gaze, the LSTM model significantly outperforms single cue baselines

Abstract

As robots are becoming more and more ubiquitous in human environments, it will be necessary for robotic systems to better understand and predict human actions. However, this is not an easy task, at times not even for us humans, but based on a relatively structured set of possible actions, appropriate cues, and the right model, this problem can be computationally tackled. In this paper, we propose to use an ensemble of long-short term memory (LSTM) networks for human action prediction. To train and evaluate models, we used the MoGaze dataset - currently the most comprehensive dataset capturing poses of human joints and the human gaze. We have thoroughly analyzed the MoGaze dataset and selected a reduced set of cues for this task. Our model can predict (i) which of the labeled objects the human is going to grasp, and (ii) which of the macro locations the human is going to visit (such as…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Gaze Tracking and Assistive Technology · Hand Gesture Recognition Systems