TL;DR
SerumRNN is an educational tool that guides users step-by-step in applying audio effects to transform input sounds into desired outputs, improving efficiency and learning in complex VST synthesizer programming.
Contribution
It introduces a novel iterative system that provides effect sequencing instructions for complex VST synthesizers, outperforming baseline methods.
Findings
SerumRNN offers consistent and useful feedback across various effects.
The system learns to prioritize effects effectively.
It discovers more efficient effect sequences than baseline approaches.
Abstract
Learning to program an audio production VST synthesizer is a time consuming process, usually obtained through inefficient trial and error and only mastered after years of experience. As an educational and creative tool for sound designers, we propose SerumRNN: a system that provides step-by-step instructions for applying audio effects to change a user's input audio towards a desired sound. We apply our system to Xfer Records Serum: currently one of the most popular and complex VST synthesizers used by the audio production community. Our results indicate that SerumRNN is consistently able to provide useful feedback for a variety of different audio effects and synthesizer presets. We demonstrate the benefits of using an iterative system and show that SerumRNN learns to prioritize effects and can discover more efficient effect order sequences than a variety of baselines.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
