The Scattering Transform Network with Generalized Morse Wavelets and Its Application to Music Genre Classification
Wai Ho Chak, Naoki Saito, David Weber

TL;DR
This paper introduces the GMW-STN, a novel scattering transform network utilizing Generalized Morse Wavelets, which enhances music genre classification by providing better interpretability and improved performance over traditional methods.
Contribution
The paper proposes replacing Morlet wavelets with Generalized Morse Wavelets in the scattering transform network, improving interpretability and classification accuracy for nonstationary signals like music.
Findings
GMW-STN outperforms conventional STN in music genre classification.
Increasing layers to three improves GMW-STN performance.
GMWs provide better multiscale amplitude and phase information.
Abstract
We propose to use the Generalized Morse Wavelets (GMWs) instead of commonly-used Morlet (or Gabor) wavelets in the Scattering Transform Network (STN), which we call the GMW-STN, for signal classification problems. The GMWs form a parameterized family of truly analytic wavelets while the Morlet wavelets are only approximately analytic. The analyticity of underlying wavelet filters in the STN is particularly important for nonstationary oscillatory signals such as music signals because it improves interpretability of the STN representations by providing multiscale amplitude and phase (and consequently frequency) information of input signals. We demonstrate the superiority of the GMW-STN over the conventional STN in music genre classification using the so-called GTZAN database. Moreover, we show the performance improvement of the GMW-STN by increasing its number of layers to three over the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage and Signal Denoising Methods · Music and Audio Processing · Ultrasonics and Acoustic Wave Propagation
