That was not what I was aiming at! Differentiating human intent and outcome in a physically dynamic throwing task
Vidullan Surendran, Alan R. Wagner

TL;DR
This paper develops a dataset and models to distinguish human intent from actual outcomes in dynamic throwing tasks, improving intent recognition accuracy in human-robot collaboration.
Contribution
It introduces a new dataset of throwing actions, a novel facial reaction-based outcome prediction model, and an end-to-end pipeline for intent recognition in dynamic tasks.
Findings
Outcome prediction accuracy improved by 38% over previous methods.
Nearly half of the throws were mistakes, with 16% missing the target.
The models effectively differentiate between intent and outcome in dynamic scenarios.
Abstract
Recognising intent in collaborative human robot tasks can improve team performance and human perception of robots. Intent can differ from the observed outcome in the presence of mistakes which are likely in physically dynamic tasks. We created a dataset of 1227 throws of a ball at a target from 10 participants and observed that 47% of throws were mistakes with 16% completely missing the target. Our research leverages facial images capturing the person's reaction to the outcome of a throw to predict when the resulting throw is a mistake and then we determine the actual intent of the throw. The approach we propose for outcome prediction performs 38% better than the two-stream architecture used previously for this task on front-on videos. In addition, we propose a 1-D CNN model which is used in conjunction with priors learned from the frequency of mistakes to provide an end-to-end pipeline…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
