Multichannel Robot Speech Recognition Database: MChRSR
Jos\'e Novoa, Juan Pablo Escudero, Josu\'e Fredes, Jorge Wuth, Rodrigo, Mahu, N\'estor Becerra Yoma

TL;DR
This paper introduces the MChRSR database, a multichannel speech dataset recorded in real robot interaction scenarios with various movements, aimed at improving robot speech recognition under noisy conditions.
Contribution
It presents a new multichannel speech database recorded in real HRI scenarios with robot movements, facilitating research on noise-robust robot speech recognition.
Findings
12 hours of multichannel data collected
Recorded in real HRI scenarios with robot movements
Re-recorded Aurora 4 in different movement conditions
Abstract
In real human robot interaction (HRI) scenarios, speech recognition represents a major challenge due to robot noise, background noise and time-varying acoustic channel. This document describes the procedure used to obtain the Multichannel Robot Speech Recognition Database (MChRSR). It is composed of 12 hours of multichannel evaluation data recorded in a real mobile HRI scenario. This database was recorded with a PR2 robot performing different translational and azimuthal movements. Accordingly, 16 evaluation sets were obtained re-recording the clean set of the Aurora 4 database in different movement conditions.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSocial Robot Interaction and HRI · Robotics and Automated Systems · IoT-based Smart Home Systems
