ARTiS: Appearance-based Action Recognition in Task Space for Real-Time Human-Robot Collaboration
Markus Eich, Sareh Shirazi, Gordon Wyeth

TL;DR
This paper introduces ARTiS, a real-time appearance-based action recognition method for human-robot collaboration that uses visual place recognition techniques to identify human actions without extensive prior knowledge.
Contribution
It reframes action recognition as a visual place recognition problem, enabling one-shot learning of human actions in a task space for improved robot support.
Findings
Effective recognition of human actions in real-time
Successful application to IKEA assembly tasks
Outperforms traditional methods in one-shot learning
Abstract
To have a robot actively supporting a human during a collaborative task, it is crucial that robots are able to identify the current action in order to predict the next one. Common approaches make use of high-level knowledge, such as object affordances, semantics or understanding of actions in terms of pre- and post-conditions. These approaches often require hand-coded a priori knowledge, time- and resource-intensive or supervised learning techniques. We propose to reframe this problem as an appearance-based place recognition problem. In our framework, we regard sequences of visual images of human actions as a map in analogy to the visual place recognition problem. Observing the task for the second time, our approach is able to recognize pre-observed actions in a one-shot learning approach and is thereby able to recognize the current observation in the task space. We propose two new…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Hand Gesture Recognition Systems · Video Surveillance and Tracking Methods
