# Beyond the Self: Using Grounded Affordances to Interpret and Describe   Others' Actions

**Authors:** Giovanni Saponaro, Lorenzo Jamone, Alexandre Bernardino, Giampiero, Salvi

arXiv: 1902.09705 · 2020-06-12

## TL;DR

This paper presents a developmental approach enabling robots to interpret and describe human actions by leveraging learned object affordances and their own experiences, facilitating social collaboration.

## Contribution

It introduces a method for robots to reuse experience to interpret human actions and generate scene descriptions, integrating affordance learning with action recognition.

## Key findings

- Model can predict effects of actions based on object properties
- Can revise beliefs about actions from observed effects
- Generates relevant scene descriptions using probabilistic inference

## Abstract

We propose a developmental approach that allows a robot to interpret and describe the actions of human agents by reusing previous experience. The robot first learns the association between words and object affordances by manipulating the objects in its environment. It then uses this information to learn a mapping between its own actions and those performed by a human in a shared environment. It finally fuses the information from these two models to interpret and describe human actions in light of its own experience. In our experiments, we show that the model can be used flexibly to do inference on different aspects of the scene. We can predict the effects of an action on the basis of object properties. We can revise the belief that a certain action occurred, given the observed effects of the human action. In an early action recognition fashion, we can anticipate the effects when the action has only been partially observed. By estimating the probability of words given the evidence and feeding them into a pre-defined grammar, we can generate relevant descriptions of the scene. We believe that this is a step towards providing robots with the fundamental skills to engage in social collaboration with humans.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1902.09705/full.md

## Figures

21 figures with captions in the complete paper: https://tomesphere.com/paper/1902.09705/full.md

## References

51 references — full list in the complete paper: https://tomesphere.com/paper/1902.09705/full.md

---
Source: https://tomesphere.com/paper/1902.09705