A self-organizing neural network architecture for learning human-object   interactions

Luiza Mici; German I. Parisi; Stefan Wermter

arXiv:1710.01916·cs.NE·March 5, 2018

A self-organizing neural network architecture for learning human-object interactions

Luiza Mici, German I. Parisi, Stefan Wermter

PDF

TL;DR

This paper introduces a self-organizing neural network architecture that unsupervisedly learns to recognize human-object interactions from RGB-D videos, demonstrating competitive results and neurophysiological consistency.

Contribution

The novel hierarchical GWR-based architecture jointly learns body motion and object representations for interaction recognition without supervision.

Findings

01

Higher neural activation for congruent action-object pairs.

02

Competitive classification performance on benchmark datasets.

03

Unsupervised learning of action-object mappings.

Abstract

The visual recognition of transitive actions comprising human-object interactions is a key component for artificial systems operating in natural environments. This challenging task requires jointly the recognition of articulated body actions as well as the extraction of semantic elements from the scene such as the identity of the manipulated objects. In this paper, we present a self-organizing neural network for the recognition of human-object interactions from RGB-D videos. Our model consists of a hierarchy of Grow-When-Required (GWR) networks that learn prototypical representations of body motion patterns and objects, accounting for the development of action-object mappings in an unsupervised fashion. We report experimental results on a dataset of daily activities collected for the purpose of this study as well as on a publicly available benchmark dataset. In line with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.