Solving Visual Object Ambiguities when Pointing: An Unsupervised   Learning Approach

Doreen Jirak; David Biertimpel; Matthias Kerzel; Stefan Wermter

arXiv:1912.06449·cs.CV·December 16, 2019

Solving Visual Object Ambiguities when Pointing: An Unsupervised Learning Approach

Doreen Jirak, David Biertimpel, Matthias Kerzel, Stefan Wermter

PDF

2 Repos

TL;DR

This paper presents an unsupervised learning method using a GWR network to accurately recognize pointing gestures and resolve object ambiguities in cluttered environments for improved human-robot interaction.

Contribution

It introduces a markerless, unsupervised approach with a GWR network for modeling pointing gestures, enhancing real-time recognition in complex scenes.

Findings

01

GWR model effectively learns pointing-object associations

02

Approach handles ambiguities in cluttered environments

03

Markerless detection method is easily reproducible

Abstract

Whenever we are addressing a specific object or refer to a certain spatial location, we are using referential or deictic gestures usually accompanied by some verbal description. Especially pointing gestures are necessary to dissolve ambiguities in a scene and they are of crucial importance when verbal communication may fail due to environmental conditions or when two persons simply do not speak the same language. With the currently increasing advances of humanoid robots and their future integration in domestic domains, the development of gesture interfaces complementing human-robot interaction scenarios is of substantial interest. The implementation of an intuitive gesture scenario is still challenging because both the pointing intention and the corresponding object have to be correctly recognized in real-time. The demand increases when considering pointing gestures in a cluttered…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.