The Effect of Training Dataset Size on Discriminative and Diffusion-Based Speech Enhancement Systems
Philippe Gonzalez, Zheng-Hua Tan, Jan {\O}stergaard, Jesper Jensen,, Tommy Sonne Alstr{\o}m, Tobias May

TL;DR
This study compares how training dataset size affects the performance of discriminative and diffusion-based speech enhancement models, revealing that diffusion models excel with small datasets but are outperformed by discriminative models on larger datasets.
Contribution
It systematically evaluates the impact of dataset size on both diffusion-based and discriminative speech enhancement systems under controlled data diversity conditions.
Findings
Diffusion models perform best with datasets of 10 hours or less.
Discriminative models improve more with larger datasets, surpassing diffusion models at 100 hours.
Diffusion models' performance plateaus regardless of increased dataset size.
Abstract
The performance of deep neural network-based speech enhancement systems typically increases with the training dataset size. However, studies that investigated the effect of training dataset size on speech enhancement performance did not consider recent approaches, such as diffusion-based generative models. Diffusion models are typically trained with massive datasets for image generation tasks, but whether this is also required for speech enhancement is unknown. Moreover, studies that investigated the effect of training dataset size did not control for the data diversity. It is thus unclear whether the performance improvement was due to the increased dataset size or diversity. Therefore, we systematically investigate the effect of training dataset size on the performance of popular state-of-the-art discriminative and diffusion-based speech enhancement systems in matched conditions. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSparse Evolutionary Training · Diffusion
