Environmental Sounds Spectrogram Classification using Log-Gabor Filters   and Multiclass Support Vector Machines

Sameh Souli; Zied Lachiri

arXiv:1209.5756·cs.CV·September 27, 2012·2 cites

Environmental Sounds Spectrogram Classification using Log-Gabor Filters and Multiclass Support Vector Machines

Sameh Souli, Zied Lachiri

PDF

Open Access

TL;DR

This paper introduces novel feature extraction techniques for environmental sound spectrogram classification using log-Gabor filters and multiclass SVMs, demonstrating improved efficiency over existing methods.

Contribution

It proposes three new methods for spectrogram feature extraction with log-Gabor filters, identifying the most effective approach for environmental sound classification.

Findings

01

Second method outperforms others in classification accuracy

02

Spectrogram segmentation enhances feature extraction

03

Log-Gabor filter bank improves feature robustness

Abstract

This paper presents novel approaches for efficient feature extraction using environmental sound magnitude spectrogram. We propose approach based on the visual domain. This approach included three methods. The first method is based on extraction for each spectrogram a single log-Gabor filter followed by mutual information procedure. In the second method, the spectrogram is passed by the same steps of the first method but with an averaged bank of 12 log-Gabor filter. The third method consists of spectrogram segmentation into three patches, and after that for each spectrogram patch we applied the second method. The classification results prove that the second method is the most efficient in our environmental sound classification system.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMusic and Audio Processing · Speech and Audio Processing · Blind Source Separation Techniques