BigWavGAN: A Wave-To-Wave Generative Adversarial Network for Music   Super-Resolution

Yenan Zhang; Hiroshi Watanabe

arXiv:2308.06483·cs.SD·October 31, 2023

BigWavGAN: A Wave-To-Wave Generative Adversarial Network for Music Super-Resolution

Yenan Zhang, Hiroshi Watanabe

PDF

Open Access

TL;DR

BigWavGAN is a novel wave-to-wave GAN model that significantly improves music super-resolution quality by integrating large-scale models with advanced discriminators and adversarial training, outperforming existing methods.

Contribution

The paper introduces BigWavGAN, combining Demucs with multi-scale and multi-resolution discriminators, enhancing music super-resolution beyond current state-of-the-art models.

Findings

01

Outperforms SOTA in simulated scenarios

02

Generates high perceptual quality music

03

Shows superior generalization to out-of-distribution data

Abstract

Generally, Deep Neural Networks (DNNs) are expected to have high performance when their model size is large. However, large models failed to produce high-quality results commensurate with their scale in music Super-Resolution (SR). We attribute this to that DNNs cannot learn information commensurate with their size from standard mean square error losses. To unleash the potential of large DNN models in music SR, we propose BigWavGAN, which incorporates Demucs, a large-scale wave-to-wave model, with State-Of-The-Art (SOTA) discriminators and adversarial training strategies. Our discriminator consists of Multi-Scale Discriminator (MSD) and Multi-Resolution Discriminator (MRD). During inference, since only the generator is utilized, there are no additional parameters or computational resources required compared to the baseline model Demucs. Objective evaluation affirms the effectiveness of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image Processing Techniques · Image and Signal Denoising Methods · Image Processing Techniques and Applications