ConferencingSpeech 2022 Challenge: Non-intrusive Objective Speech   Quality Assessment (NISQA) Challenge for Online Conferencing Applications

Gaoxiong Yi; Wei Xiao; Yiming Xiao; Babak Naderi; Sebastian M\"oller,; Wafaa Wardah; Gabriel Mittag; Ross Cutler; Zhuohuang Zhang; Donald S.; Williamson; Fei Chen; Fuzheng Yang; Shidong Shang

arXiv:2203.16032·cs.SD·April 4, 2022

ConferencingSpeech 2022 Challenge: Non-intrusive Objective Speech Quality Assessment (NISQA) Challenge for Online Conferencing Applications

Gaoxiong Yi, Wei Xiao, Yiming Xiao, Babak Naderi, Sebastian M\"oller,, Wafaa Wardah, Gabriel Mittag, Ross Cutler, Zhuohuang Zhang, Donald S., Williamson, Fei Chen, Fuzheng Yang, Shidong Shang

PDF

Open Access

TL;DR

The paper presents the ConferencingSpeech 2022 challenge, which aims to develop non-intrusive deep learning models for objective speech quality assessment in online conferencing, supported by a large open dataset and comprehensive evaluation.

Contribution

It introduces a large-scale open dataset with subjective quality scores and benchmarks multiple models for non-intrusive speech quality assessment in conferencing scenarios.

Findings

01

Multiple models achieved competitive performance on the blind test set.

02

The challenge demonstrated the effectiveness of deep neural networks for objective speech quality prediction.

03

Open-sourcing the dataset facilitates further research in speech quality assessment.

Abstract

With the advances in speech communication systems such as online conferencing applications, we can seamlessly work with people regardless of where they are. However, during online meetings, speech quality can be significantly affected by background noise, reverberation, packet loss, network jitter, etc. Because of its nature, speech quality is traditionally assessed in subjective tests in laboratories and lately also in crowdsourcing following the international standards from ITU-T Rec. P.800 series. However, those approaches are costly and cannot be applied to customer data. Therefore, an effective objective assessment approach is needed to evaluate or monitor the speech quality of the ongoing conversation. The ConferencingSpeech 2022 challenge targets the non-intrusive deep neural network models for the speech quality assessment task. We open-sourced a training corpus with more than…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and Audio Processing · Speech Recognition and Synthesis