Sound-Based Recognition of Touch Gestures and Emotions for Enhanced Human-Robot Interaction
Yuanbo Hou, Qiaoqiao Ren, Wenwu Wang, Dick Botteldooren

TL;DR
This paper introduces a lightweight audio-based model that recognizes touch gestures and emotional states during human-robot interactions, addressing privacy concerns and hardware limitations of humanoid robots.
Contribution
It presents a novel, small-scale audio-only recognition model for tactile gestures and emotions, suitable for privacy-preserving and resource-constrained robotic applications.
Findings
Effective recognition of arousal and valence in emotions
Accurate tactile gesture classification from sound data
Comparable performance to larger pretrained audio models
Abstract
Emotion recognition and touch gesture decoding are crucial for advancing human-robot interaction (HRI), especially in social environments where emotional cues and tactile perception play important roles. However, many humanoid robots, such as Pepper, Nao, and Furhat, lack full-body tactile skin, limiting their ability to engage in touch-based emotional and gesture interactions. In addition, vision-based emotion recognition methods usually face strict GDPR compliance challenges due to the need to collect personal facial data. To address these limitations and avoid privacy issues, this paper studies the potential of using the sounds produced by touching during HRI to recognise tactile gestures and classify emotions along the arousal and valence dimensions. Using a dataset of tactile gestures and emotional interactions from 28 participants with the humanoid robot Pepper, we design an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSocial Robot Interaction and HRI · Hand Gesture Recognition Systems
