Music Genre Classification Using Spectral Analysis and Sparse Representation of the Signals
Mehdi Banitalebi-Dehkordi, Amin Banitalebi-Dehkordi

TL;DR
This paper introduces a robust music genre classification approach combining spectral analysis and sparse representation, achieving high accuracy and efficiency on the GTZAN dataset.
Contribution
It presents a novel feature extraction method using sparse FFT and spectral analysis, improving classification accuracy and computational efficiency over existing SRC approaches.
Findings
Outperforms state-of-the-art SRC methods on GTZAN dataset
Reduces signal dimensionality significantly
Offers better computational efficiency than CS-based classifiers
Abstract
In this paper, we proposed a robust music genre classification method based on a sparse FFT based feature extraction method which extracted with discriminating power of spectral analysis of non-stationary audio signals, and the capability of sparse representation based classifiers. Feature extraction method combines two sets of features namely short-term features (extracted from windowed signals) and long-term features (extracted from combination of extracted short-time features). Experimental results demonstrate that the proposed feature extraction method leads to a sparse representation of audio signals. As a result, a significant reduction in the dimensionality of the signals is achieved. The extracted features are then fed into a sparse representation based classifier (SRC). Our experimental results on the GTZAN database demonstrate that the proposed method outperforms the other…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Speech and Audio Processing · Blind Source Separation Techniques
