Anti-aliasing of neural distortion effects via model fine tuning
Alistair Carson, Alec Wright, Stefan Bilbao

TL;DR
This paper introduces a fine-tuning method for neural guitar distortion models that significantly reduces aliasing artifacts caused by high-frequency inputs, improving perceptual quality without extensive oversampling.
Contribution
A novel teacher-student fine-tuning approach that suppresses aliasing in neural distortion models using aliasing-free training data, applicable to LSTM and TCN architectures.
Findings
Aliasing was reduced more than twofold oversampling in most cases.
Harmonic distortion components are affected, with model-dependent effects.
LSTM models achieved the best balance between anti-aliasing and similarity to analog devices.
Abstract
Neural networks have become ubiquitous with guitar distortion effects modelling in recent years. Despite their ability to yield perceptually convincing models, they are susceptible to frequency aliasing when driven by high frequency and high gain inputs. Nonlinear activation functions create both the desired harmonic distortion and unwanted aliasing distortion as the bandwidth of the signal is expanded beyond the Nyquist frequency. Here, we present a method for reducing aliasing in neural models via a teacher-student fine tuning approach, where the teacher is a pre-trained model with its weights frozen, and the student is a copy of this with learnable parameters. The student is fine-tuned against an aliasing-free dataset generated by passing sinusoids through the original model and removing non-harmonic components from the output spectra. Our results show that this method significantly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Reservoir Computing · Music Technology and Sound Studies · Speech and Audio Processing
MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory
