DNSMOS P.835: A Non-Intrusive Perceptual Objective Speech Quality Metric to Evaluate Noise Suppressors
Chandan K A Reddy, Vishak Gopal, Ross Cutler

TL;DR
This paper introduces DNSMOS P.835, a non-intrusive speech quality metric that predicts speech, noise, and overall quality scores aligned with human subjective evaluations, useful for evaluating noise suppression algorithms.
Contribution
It presents the first non-intrusive P.835-based speech quality metric trained on human ratings, providing accurate, three-dimensional quality assessments.
Findings
Highly correlated with human ratings (PCC=0.94 for speech, 0.98 for noise and overall)
First non-intrusive P.835 predictor available
Available as a public Azure service
Abstract
Human subjective evaluation is the gold standard to evaluate speech quality optimized for human perception. Perceptual objective metrics serve as a proxy for subjective scores. We have recently developed a non-intrusive speech quality metric called Deep Noise Suppression Mean Opinion Score (DNSMOS) using the scores from ITU-T Rec. P.808 subjective evaluation. The P.808 scores reflect the overall quality of the audio clip. ITU-T Rec. P.835 subjective evaluation framework gives the standalone quality scores of speech and background noise in addition to the overall quality. In this work, we train an objective metric based on P.835 human ratings that outputs 3 scores: i) speech quality (SIG), ii) background noise quality (BAK), and iii) the overall quality (OVRL) of the audio. The developed metric is highly correlated with human ratings, with a Pearson's Correlation Coefficient (PCC)=0.94…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and Audio Processing · Advanced Adaptive Filtering Techniques · Hearing Loss and Rehabilitation
Methodstravel james · Attentive Walk-Aggregating Graph Neural Network · Normalizing Flows · Sliced Iterative Generator
