Sinusoidal wave generating network based on adversarial learning and its   application: synthesizing frog sounds for data augmentation

Sangwook Park; David K. Han; and Hanseok Ko

arXiv:1901.02050·cs.SD·January 9, 2019

Sinusoidal wave generating network based on adversarial learning and its application: synthesizing frog sounds for data augmentation

Sangwook Park, David K. Han, and Hanseok Ko

PDF

Open Access

TL;DR

This paper introduces a sinusoidal wave generative network based on adversarial learning, capable of synthesizing realistic frog sounds for data augmentation, which improves signal processing and classification tasks.

Contribution

The paper presents a novel adversarial learning-based sinusoidal wave generator specifically designed for audio synthesis and data augmentation in bioacoustic applications.

Findings

01

The proposed network produces realistic sinusoidal wave signals.

02

Synthetic sounds enhance amphibian sound classification accuracy.

03

The model outperforms traditional methods in generating audio signals.

Abstract

Simulators that generate observations based on theoretical models can be important tools for development, prediction, and assessment of signal processing algorithms. In order to design these simulators, painstaking effort is required to construct mathematical models according to their application. Complex models are sometimes necessary to represent a variety of real phenomena. In contrast, obtaining synthetic observations from generative models developed from real observations often require much less effort. This paper proposes a generative model based on adversarial learning. Given that observations are typically signals composed of a linear combination of sinusoidal waves and random noises, sinusoidal wave generating networks are first designed based on an adversarial network. Audio waveform generation can then be performed using the proposed network. Several approaches to designing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMusic and Audio Processing · Model Reduction and Neural Networks · Speech and Audio Processing