ConferencingSpeech 2022 Challenge: Non-intrusive Objective Speech Quality Assessment (NISQA) Challenge for Online Conferencing Applications
Gaoxiong Yi, Wei Xiao, Yiming Xiao, Babak Naderi, Sebastian M\"oller,, Wafaa Wardah, Gabriel Mittag, Ross Cutler, Zhuohuang Zhang, Donald S., Williamson, Fei Chen, Fuzheng Yang, Shidong Shang

TL;DR
The paper presents the ConferencingSpeech 2022 challenge, which aims to develop non-intrusive deep learning models for objective speech quality assessment in online conferencing, supported by a large open dataset and comprehensive evaluation.
Contribution
It introduces a large-scale open dataset with subjective quality scores and benchmarks multiple models for non-intrusive speech quality assessment in conferencing scenarios.
Findings
Multiple models achieved competitive performance on the blind test set.
The challenge demonstrated the effectiveness of deep neural networks for objective speech quality prediction.
Open-sourcing the dataset facilitates further research in speech quality assessment.
Abstract
With the advances in speech communication systems such as online conferencing applications, we can seamlessly work with people regardless of where they are. However, during online meetings, speech quality can be significantly affected by background noise, reverberation, packet loss, network jitter, etc. Because of its nature, speech quality is traditionally assessed in subjective tests in laboratories and lately also in crowdsourcing following the international standards from ITU-T Rec. P.800 series. However, those approaches are costly and cannot be applied to customer data. Therefore, an effective objective assessment approach is needed to evaluate or monitor the speech quality of the ongoing conversation. The ConferencingSpeech 2022 challenge targets the non-intrusive deep neural network models for the speech quality assessment task. We open-sourced a training corpus with more than…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and Audio Processing · Speech Recognition and Synthesis
