Improving Adversarial Waveform Generation based Singing Voice Conversion with Harmonic Signals
Haohan Guo, Zhiping Zhou, Fanbo Meng, Kai Liu

TL;DR
This paper introduces a method to improve singing voice conversion by incorporating harmonic signals, extracted from pitch and filtered with neural networks, to enhance audio quality and harmonic continuity in GAN-based models.
Contribution
It proposes feeding harmonic signals into GAN-based SVC models, improving stability, harmonic smoothness, and audio fidelity over traditional methods.
Findings
Significant improvement in MOS scores for fidelity and timbre.
Enhanced harmonic smoothness and continuity in generated audio.
Filtered excitation better matches target audio characteristics.
Abstract
Adversarial waveform generation has been a popular approach as the backend of singing voice conversion (SVC) to generate high-quality singing audio. However, the instability of GAN also leads to other problems, such as pitch jitters and U/V errors. It affects the smoothness and continuity of harmonics, hence degrades the conversion quality seriously. This paper proposes to feed harmonic signals to the SVC model in advance to enhance audio generation. We extract the sine excitation from the pitch, and filter it with a linear time-varying (LTV) filter estimated by a neural network. Both these two harmonic signals are adopted as the inputs to generate the singing waveform. In our experiments, two mainstream models, MelGAN and ParallelWaveGAN, are investigated to validate the effectiveness of the proposed approach. We conduct a MOS test on clean and noisy test sets. The result shows that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and Audio Processing · Model Reduction and Neural Networks · Music and Audio Processing
