Effects of Dataset Sampling Rate for Noise Cancellation through Deep   Learning

Brandon Colelough; Andrew Zheng

arXiv:2405.20884·cs.SD·June 3, 2024

Effects of Dataset Sampling Rate for Noise Cancellation through Deep Learning

Brandon Colelough, Andrew Zheng

PDF

Open Access

TL;DR

This study investigates how different dataset sampling rates affect the performance of deep neural networks, specifically ConvTasNET, in noise cancellation tasks suitable for mobile devices, highlighting a trade-off between audio quality and processing time.

Contribution

It demonstrates that training ConvTasNET on higher sampling rate datasets improves noise cancellation quality but increases processing time, informing optimal sampling choices for mobile applications.

Findings

01

Higher sampling rates (48kHz) improve audio quality metrics.

02

Processing time increases with higher sampling rates.

03

ConvTasNET trained at 48kHz outperforms lower rates in noise suppression.

Abstract

Background: Active noise cancellation has been a subject of research for decades. Traditional techniques, like the Fast Fourier Transform, have limitations in certain scenarios. This research explores the use of deep neural networks (DNNs) as a superior alternative. Objective: The study aims to determine the effect sampling rate within training data has on lightweight, efficient DNNs that operate within the processing constraints of mobile devices. Methods: We chose the ConvTasNET network for its proven efficiency in speech separation and enhancement. ConvTasNET was trained on datasets such as WHAM!, LibriMix, and the MS-2023 DNS Challenge. The datasets were sampled at rates of 8kHz, 16kHz, and 48kHz to analyze the effect of sampling rate on noise cancellation efficiency and effectiveness. The model was tested on a core-i7 Intel processor from 2023, assessing the network's ability to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and Audio Processing · Image and Signal Denoising Methods

MethodsConvolutional time-domain audio separation network