# Towards Understanding Language through Perception in Situated   Human-Robot Interaction: From Word Grounding to Grammar Induction

**Authors:** Amir Aly, Tadahiro Taniguchi

arXiv: 1812.04840 · 2020-03-16

## TL;DR

This paper explores how robots can understand human language by grounding words in visual perception and inducing grammatical structures, enabling better comprehension of instructions during interaction.

## Contribution

It introduces methods for grounding parts of speech through perception and inducing CCG grammar, advancing robot language understanding capabilities.

## Key findings

- Grounding parts of speech via visual perception.
- Inducing CCG grammar for phrase understanding.
- Improved robot comprehension of human instructions.

## Abstract

Robots are widely collaborating with human users in diferent tasks that require high-level cognitive functions to make them able to discover the surrounding environment. A difcult challenge that we briefy highlight in this short paper is inferring the latent grammatical structure of language, which includes grounding parts of speech (e.g., verbs, nouns, adjectives, and prepositions) through visual perception, and induction of Combinatory Categorial Grammar (CCG) for phrases. This paves the way towards grounding phrases so as to make a robot able to understand human instructions appropriately during interaction.

---
Source: https://tomesphere.com/paper/1812.04840