An investigation of pre-upsampling generative modelling and Generative Adversarial Networks in audio super resolution
James King, Ramon Vi\~nas Torn\'e, Alexander Campbell, Pietro Li\`o

TL;DR
This paper compares pre-upsampling generative models and GAN-based approaches for audio super-resolution, demonstrating a new model that outperforms existing methods in quality and efficiency.
Contribution
It introduces AudioEDSR, a novel pre-upsampling generative model inspired by image super-resolution, and explores integrating it with Wasserstein GANs for improved audio super-resolution.
Findings
AudioEDSR outperforms AudioUNet with 20% lower spectral distance.
AudioEDSR achieves a mean opinion score of 4.06, higher than 3.82.
AudioEDSR has 87% fewer parameters than AudioUNet.
Abstract
There have been several successful deep learning models that perform audio super-resolution. Many of these approaches involve using preprocessed feature extraction which requires a lot of domain-specific signal processing knowledge to implement. Convolutional Neural Networks (CNNs) improved upon this framework by automatically learning filters. An example of a convolutional approach is AudioUNet, which takes inspiration from novel methods of upsampling images. Our paper compares the pre-upsampling AudioUNet to a new generative model that upsamples the signal before using deep learning to transform it into a more believable signal. Based on the EDSR network for image super-resolution, the newly proposed model outperforms UNet with a 20% increase in log spectral distance and a mean opinion score of 4.06 compared to 3.82 for the two times upsampling case. AudioEDSR also has 87% fewer…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image Processing Techniques · Image and Signal Denoising Methods · Digital Media Forensic Detection
