DPN-GAN: Inducing Periodic Activations in Generative Adversarial Networks for High-Fidelity Audio Synthesis
Zeeshan Ahmad, Shudi Bao, Meng Chen

TL;DR
DPN-GAN introduces a novel GAN architecture with periodic activations and deformable convolutions, significantly improving high-fidelity audio synthesis and robustness across diverse datasets.
Contribution
The paper proposes DPN-GAN, a new GAN model that uses periodic ReLU activations and deformable convolutions to enhance audio quality and diversity.
Findings
Outperforms state-of-the-art GANs on multiple audio datasets.
Produces higher fidelity and more robust audio synthesis.
Effective in both speech and music generation tasks.
Abstract
In recent years, generative adversarial networks (GANs) have made significant progress in generating audio sequences. However, these models typically rely on bandwidth-limited mel-spectrograms, which constrain the resolution of generated audio sequences, and lead to mode collapse during conditional generation. To address this issue, we propose Deformable Periodic Network based GAN (DPN-GAN), a novel GAN architecture that incorporates a kernel-based periodic ReLU activation function to induce periodic bias in audio generation. This innovative approach enhances the model's ability to capture and reproduce intricate audio patterns. In particular, our proposed model features a DPN module for multi-resolution generation utilizing deformable convolution operations, allowing for adaptive receptive fields that improve the quality and fidelity of the synthetic audio. Additionally, we enhance the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Methods*Communicated@Fast*How Do I Communicate to Expedia? · 1x1 Convolution · Average Pooling · Residual Connection · Concatenated Skip Connection · Softmax · Grouped Convolution · DPN Block · Batch Normalization · Global Average Pooling
