NAS-FM: Neural Architecture Search for Tunable and Interpretable Sound   Synthesis based on Frequency Modulation

Zhen Ye; Wei Xue; Xu Tan; Qifeng Liu; Yike Guo

arXiv:2305.12868·cs.SD·May 23, 2023·1 cites

NAS-FM: Neural Architecture Search for Tunable and Interpretable Sound Synthesis based on Frequency Modulation

Zhen Ye, Wei Xue, Xu Tan, Qifeng Liu, Yike Guo

PDF

Open Access

TL;DR

This paper introduces NAS-FM, a neural architecture search method that automatically designs tunable and interpretable frequency modulation synthesizers, outperforming handcrafted counterparts without requiring expert knowledge.

Contribution

It presents a novel NAS approach for automatic FM synthesizer design, enabling flexible, interpretable sound synthesis without prior expertise.

Findings

01

Automatically builds effective synthesizers for diverse sounds

02

Outperforms handcrafted synthesizers in quality

03

Reduces manual design and tuning effort

Abstract

Developing digital sound synthesizers is crucial to the music industry as it provides a low-cost way to produce high-quality sounds with rich timbres. Existing traditional synthesizers often require substantial expertise to determine the overall framework of a synthesizer and the parameters of submodules. Since expert knowledge is hard to acquire, it hinders the flexibility to quickly design and tune digital synthesizers for diverse sounds. In this paper, we propose ``NAS-FM'', which adopts neural architecture search (NAS) to build a differentiable frequency modulation (FM) synthesizer. Tunable synthesizers with interpretable controls can be developed automatically from sounds without any prior expert knowledge and manual operating costs. In detail, we train a supernet with a specifically designed search space, including predicting the envelopes of carriers and modulators with different…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMusic and Audio Processing · Speech and Audio Processing · Music Technology and Sound Studies