TL;DR
This paper presents an unsupervised learning method using a GWR network to accurately recognize pointing gestures and resolve object ambiguities in cluttered environments for improved human-robot interaction.
Contribution
It introduces a markerless, unsupervised approach with a GWR network for modeling pointing gestures, enhancing real-time recognition in complex scenes.
Findings
GWR model effectively learns pointing-object associations
Approach handles ambiguities in cluttered environments
Markerless detection method is easily reproducible
Abstract
Whenever we are addressing a specific object or refer to a certain spatial location, we are using referential or deictic gestures usually accompanied by some verbal description. Especially pointing gestures are necessary to dissolve ambiguities in a scene and they are of crucial importance when verbal communication may fail due to environmental conditions or when two persons simply do not speak the same language. With the currently increasing advances of humanoid robots and their future integration in domestic domains, the development of gesture interfaces complementing human-robot interaction scenarios is of substantial interest. The implementation of an intuitive gesture scenario is still challenging because both the pointing intention and the corresponding object have to be correctly recognized in real-time. The demand increases when considering pointing gestures in a cluttered…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
