CycleGAN Voice Conversion of Spectral Envelopes using Adversarial Weights
Rafael Ferro, Nicolas Obin, Axel Roebel

TL;DR
This paper introduces spectral envelope-based voice conversion using CycleGAN with novel adversarial weight training methods, improving training stability and achieving state-of-the-art results with reduced network complexity.
Contribution
It proposes two new adversarial weight training paradigms for CycleGAN, enhancing training stability and efficiency in voice conversion tasks.
Findings
Energy-constrained CycleGAN improves conversion quality.
Proposed methods outperform previous weighted GAN approaches.
Achieves state-of-the-art results with smaller networks.
Abstract
This paper tackles GAN optimization and stability issues in the context of voice conversion. First, to simplify the conversion task, we propose to use spectral envelopes as inputs. Second we propose two adversarial weight training paradigms, the generalized weighted GAN and the generator impact GAN, both aim at reducing the impact of the generator on the discriminator, so both can learn more gradually and efficiently during training. Applying an energy constraint to the cycleGAN paradigm considerably improved conversion quality. A subjective experiment conducted on a voice conversion task on the voice conversion challenge 2018 dataset shows first that despite a significantly reduced network complexity, the proposed method achieves state-of-the-art results, and second that the proposed weighted GAN methods outperform a previously proposed one.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsBatch Normalization · Residual Connection · PatchGAN · *Communicated@Fast*How Do I Communicate to Expedia? · Tanh Activation · Residual Block · Instance Normalization · HuMan(Expedia)||How do I get a human at Expedia? · Sigmoid Activation · GAN Least Squares Loss
