Acoustic scene classification using auditory datasets

Jayesh Kumpawat; Shubhajit Dey

arXiv:2112.13450·cs.SD·July 18, 2022·1 cites

Acoustic scene classification using auditory datasets

Jayesh Kumpawat, Shubhajit Dey

PDF

Open Access 1 Repo

TL;DR

This paper explores advanced mathematical and machine learning techniques for acoustic scene classification, utilizing data augmentation and spectrogram optimization to improve audio analysis and open new research avenues.

Contribution

It introduces novel problem-specific methods, including data augmentation and spectrogram optimization, enhancing acoustic scene classification performance and expanding research possibilities.

Findings

01

Improved accuracy with data augmentation techniques

02

Effective spectrogram optimization strategies

03

Potential for cross-domain audio analysis applications

Abstract

The approach used not only challenges some of the fundamental mathematical techniques used so far in early experiments of the same trend but also introduces new scopes and new horizons for interesting results. The physics governing spectrograms have been optimized in the project along with exploring how it handles the intense requirements of the problem at hand. Major contributions and developments brought under the light, through this project involve using better mathematical techniques and problem-specific machine learning methods. Improvised data analysis and data augmentation for audio datasets like frequency masking and random frequency-time stretching are used in the project and hence are explained in this paper. In the used methodology, the audio transforms principle were also tried and explored, and indeed the insights gained were used constructively in the later stages of the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Shubhajit412/ACOUSTIC_SCENE_CLASSIFICATION_USING_AUDITORY_DATASETS
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and Audio Processing · Music and Audio Processing