On The Effect Of Coding Artifacts On Acoustic Scene Classification

Nagashree K. S. Rao; Nils Peters

arXiv:2112.04841·eess.AS·December 10, 2021

On The Effect Of Coding Artifacts On Acoustic Scene Classification

Nagashree K. S. Rao, Nils Peters

PDF

Open Access

TL;DR

This paper investigates how perceptual audio coding affects acoustic scene classification accuracy, revealing significant degradation with compression and proposing training methods to mitigate this effect for resource-constrained devices.

Contribution

It demonstrates the impact of perceptual audio coding on classification performance and introduces training strategies to improve accuracy on compressed audio signals.

Findings

01

Classification accuracy degrades up to 57% with audio compression.

02

Lossy compression techniques during training improve accuracy on compressed audio.

03

Training with compressed audio enhances robustness across different codecs and bitrates.

Abstract

Previous DCASE challenges contributed to an increase in the performance of acoustic scene classification systems. State-of-the-art classifiers demand significant processing capabilities and memory which is challenging for resource-constrained mobile or IoT edge devices. Thus, it is more likely to deploy these models on more powerful hardware and classify audio recordings previously uploaded (or streamed) from low-power edge devices. In such scenario, the edge device may apply perceptual audio coding to reduce the transmission data rate. This paper explores the effect of perceptual audio coding on the classification performance using a DCASE 2020 challenge contribution [1]. We found that classification accuracy can degrade by up to 57% compared to classifying original (uncompressed) audio. We further demonstrate how lossy audio compression techniques during model training can improve…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and Audio Processing · Music and Audio Processing · Music Technology and Sound Studies