Multichannel Robot Speech Recognition Database: MChRSR

Jos\'e Novoa; Juan Pablo Escudero; Josu\'e Fredes; Jorge Wuth; Rodrigo; Mahu; N\'estor Becerra Yoma

arXiv:1801.00061·cs.HC·January 3, 2018·1 cites

Multichannel Robot Speech Recognition Database: MChRSR

Jos\'e Novoa, Juan Pablo Escudero, Josu\'e Fredes, Jorge Wuth, Rodrigo, Mahu, N\'estor Becerra Yoma

PDF

Open Access

TL;DR

This paper introduces the MChRSR database, a multichannel speech dataset recorded in real robot interaction scenarios with various movements, aimed at improving robot speech recognition under noisy conditions.

Contribution

It presents a new multichannel speech database recorded in real HRI scenarios with robot movements, facilitating research on noise-robust robot speech recognition.

Findings

01

12 hours of multichannel data collected

02

Recorded in real HRI scenarios with robot movements

03

Re-recorded Aurora 4 in different movement conditions

Abstract

In real human robot interaction (HRI) scenarios, speech recognition represents a major challenge due to robot noise, background noise and time-varying acoustic channel. This document describes the procedure used to obtain the Multichannel Robot Speech Recognition Database (MChRSR). It is composed of 12 hours of multichannel evaluation data recorded in a real mobile HRI scenario. This database was recorded with a PR2 robot performing different translational and azimuthal movements. Accordingly, 16 evaluation sets were obtained re-recording the clean set of the Aurora 4 database in different movement conditions.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSocial Robot Interaction and HRI · Robotics and Automated Systems · IoT-based Smart Home Systems