Objective Measures of Perceptual Audio Quality Reviewed: An Evaluation   of Their Application Domain Dependence

Matteo Torcoli; Thorsten Kastner; J\"urgen Herre

arXiv:2110.11438·eess.AS·October 25, 2021

Objective Measures of Perceptual Audio Quality Reviewed: An Evaluation of Their Application Domain Dependence

Matteo Torcoli, Thorsten Kastner, J\"urgen Herre

PDF

TL;DR

This study evaluates how well various objective audio quality measures correlate with human perception across different domains, highlighting the importance of training data and auditory models for universal assessment.

Contribution

It introduces a new objective measure, SI-SA2f, and compares multiple existing measures across audio coding and source separation domains.

Findings

01

2f-model outperforms other measures in both domains

02

Training dataset and auditory model robustness are key for universal measures

03

Proposed SI-SA2f offers a new approach based on existing models

Abstract

Over the past few decades, computational methods have been developed to estimate perceptual audio quality. These methods, also referred to as objective quality measures, are usually developed and intended for a specific application domain. Because of their convenience, they are often used outside their original intended domain, even if it is unclear whether they provide reliable quality estimates in this case. This work studies the correlation of well-known state-of-the-art objective measures with human perceptual scores in two different domains: audio coding and source separation. The following objective measures are considered: fwSNRseg, dLLR, PESQ, PEAQ, POLQA, PEMO-Q, ViSQOLAudio, (SI-)BSSEval, PEASS, LKR-PI, 2f-model, and HAAQI. Additionally, a novel measure (SI-SA2f) is presented, based on the 2f-model and a BSSEval-based signal decomposition. We use perceptual scores from 7…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.