Audio synthesizer inversion in symmetric parameter spaces with approximately equivariant flow matching

Ben Hayes; Charalampos Saitis; Gy\"orgy Fazekas

arXiv:2506.07199·cs.SD·June 10, 2025

Audio synthesizer inversion in symmetric parameter spaces with approximately equivariant flow matching

Ben Hayes, Charalampos Saitis, Gy\"orgy Fazekas

PDF

Open Access 1 Repo

TL;DR

This paper addresses the challenge of inverting audio synthesizers by leveraging symmetry-aware probabilistic models, notably permutation equivariant flows, to improve parameter recovery in complex, real-world synthesizers.

Contribution

It introduces a symmetry-aware generative modeling approach, including a relaxed equivariance strategy, to better invert synthesizers considering their intrinsic symmetries.

Findings

01

Permutation-invariant regression degrades performance.

02

Conditional generative models improve inversion accuracy.

03

Permutation equivariant flows outperform baselines on real synthesizer.

Abstract

Many audio synthesizers can produce the same signal given different parameter configurations, meaning the inversion from sound to parameters is an inherently ill-posed problem. We show that this is largely due to intrinsic symmetries of the synthesizer, and focus in particular on permutation invariance. First, we demonstrate on a synthetic task that regressing point estimates under permutation symmetry degrades performance, even when using a permutation-invariant loss function or symmetry-breaking heuristics. Then, viewing equivalent solutions as modes of a probability distribution, we show that a conditional generative model substantially improves performance. Further, acknowledging the invariance of the implicit parameter distribution, we find that performance is further improved by using a permutation equivariant continuous normalizing flow. To accommodate intricate symmetries in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ben-hayes/synth-permutations
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMusic Technology and Sound Studies · Music and Audio Processing · Speech and Audio Processing

MethodsFocus