An Open source Implementation of ITU-T Recommendation P.808 with   Validation

Babak Naderi; Ross Cutler

arXiv:2005.08138·eess.AS·November 5, 2020

An Open source Implementation of ITU-T Recommendation P.808 with Validation

Babak Naderi, Ross Cutler

PDF

1 Repo

TL;DR

This paper presents an open-source implementation of ITU-T P.808 for crowdsourced speech quality assessment, validated against laboratory results, with enhancements for speed and reliability in the testing process.

Contribution

The authors developed and validated an open-source, scalable implementation of ITU-T P.808 that includes new test methods and operational improvements for crowdsourcing speech quality evaluations.

Findings

01

MOS scores closely match laboratory results

02

Reproducibility of crowdsourced assessments is high

03

System enhancements improve reliability and efficiency

Abstract

The ITU-T Recommendation P.808 provides a crowdsourcing approach for conducting a subjective assessment of speech quality using the Absolute Category Rating (ACR) method. We provide an open-source implementation of the ITU-T Rec. P.808 that runs on the Amazon Mechanical Turk platform. We extended our implementation to include Degradation Category Ratings (DCR) and Comparison Category Ratings (CCR) test methods. We also significantly speed up the test process by integrating the participant qualification step into the main rating task compared to a two-stage qualification and rating solution. We provide program scripts for creating and executing the subjective test, and data cleansing and analyzing the answers to avoid operational errors. To validate the implementation, we compare the Mean Opinion Scores (MOS) collected through our implementation with MOS values from a standard laboratory…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

microsoft/P.808
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings