Deep Convolutional Neural Networks and Data Augmentation for   Environmental Sound Classification

Justin Salamon; Juan Pablo Bello

arXiv:1608.04363·cs.SD·April 5, 2017

Deep Convolutional Neural Networks and Data Augmentation for Environmental Sound Classification

Justin Salamon, Juan Pablo Bello

PDF

5 Repos

TL;DR

This paper introduces a deep CNN architecture for environmental sound classification and demonstrates that data augmentation significantly enhances performance, achieving state-of-the-art results by addressing data scarcity issues.

Contribution

The study proposes a novel deep CNN model combined with audio data augmentation techniques to improve environmental sound classification accuracy.

Findings

01

Data augmentation improves CNN performance significantly.

02

Deep CNN outperforms shallow models with augmentation.

03

Different augmentations affect class accuracy uniquely.

Abstract

The ability of deep convolutional neural networks (CNN) to learn discriminative spectro-temporal patterns makes them well suited to environmental sound classification. However, the relative scarcity of labeled data has impeded the exploitation of this family of high-capacity models. This study has two primary contributions: first, we propose a deep convolutional neural network architecture for environmental sound classification. Second, we propose the use of audio data augmentation for overcoming the problem of data scarcity and explore the influence of different augmentations on the performance of the proposed CNN architecture. Combined with data augmentation, the proposed model produces state-of-the-art results for environmental sound classification. We show that the improved performance stems from the combination of a deep, high-capacity model and an augmented training set: this…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.