TL;DR
This paper develops an adaptive speech system for robots that improves intelligibility and user experience by adjusting to environmental acoustics and user needs, based on a neural prediction model.
Contribution
It introduces a novel neural prediction model for ambient noise and a convolutional neural network for adaptive speech parameter tuning in robots.
Findings
Adaptive speech improves intelligibility and user satisfaction.
Good acoustic quality enhances comprehension and experience.
Background noise significantly impacts speech recognition accuracy.
Abstract
Spoken language interaction is at the heart of interpersonal communication, and people flexibly adapt their speech to different individuals and environments. It is surprising that robots, and by extension other digital devices, are not equipped to adapt their speech and instead rely on fixed speech parameters, which often hinder comprehension by the user. We conducted a speech comprehension study involving 39 participants who were exposed to different environmental and contextual conditions. During the experiment, the robot articulated words using different vocal parameters, and the participants were tasked with both recognising the spoken words and rating their subjective impression of the robot's speech. The experiment's primary outcome shows that spaces with good acoustic quality positively correlate with intelligibility and user experience. However, increasing the distance between…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
