No More Mumbles: Enhancing Robot Intelligibility through Speech   Adaptation

Qiaoqiao Ren; Yuanbo Hou; Dick Botteldooren; and Tony Belpaeme

arXiv:2405.09708·cs.RO·May 17, 2024

No More Mumbles: Enhancing Robot Intelligibility through Speech Adaptation

Qiaoqiao Ren, Yuanbo Hou, Dick Botteldooren, and Tony Belpaeme

PDF

1 Repo

TL;DR

This paper develops an adaptive speech system for robots that improves intelligibility and user experience by adjusting to environmental acoustics and user needs, based on a neural prediction model.

Contribution

It introduces a novel neural prediction model for ambient noise and a convolutional neural network for adaptive speech parameter tuning in robots.

Findings

01

Adaptive speech improves intelligibility and user satisfaction.

02

Good acoustic quality enhances comprehension and experience.

03

Background noise significantly impacts speech recognition accuracy.

Abstract

Spoken language interaction is at the heart of interpersonal communication, and people flexibly adapt their speech to different individuals and environments. It is surprising that robots, and by extension other digital devices, are not equipped to adapt their speech and instead rely on fixed speech parameters, which often hinder comprehension by the user. We conducted a speech comprehension study involving 39 participants who were exposed to different environmental and contextual conditions. During the experiment, the robot articulated words using different vocal parameters, and the participants were tasked with both recognising the spoken words and rating their subjective impression of the robot's speech. The experiment's primary outcome shows that spaces with good acoustic quality positively correlate with intelligibility and user experience. However, increasing the distance between…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

qiaoqiao2323/robot-speech-intelligibility
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.