Acoustic scene classification using auditory datasets
Jayesh Kumpawat, Shubhajit Dey

TL;DR
This paper explores advanced mathematical and machine learning techniques for acoustic scene classification, utilizing data augmentation and spectrogram optimization to improve audio analysis and open new research avenues.
Contribution
It introduces novel problem-specific methods, including data augmentation and spectrogram optimization, enhancing acoustic scene classification performance and expanding research possibilities.
Findings
Improved accuracy with data augmentation techniques
Effective spectrogram optimization strategies
Potential for cross-domain audio analysis applications
Abstract
The approach used not only challenges some of the fundamental mathematical techniques used so far in early experiments of the same trend but also introduces new scopes and new horizons for interesting results. The physics governing spectrograms have been optimized in the project along with exploring how it handles the intense requirements of the problem at hand. Major contributions and developments brought under the light, through this project involve using better mathematical techniques and problem-specific machine learning methods. Improvised data analysis and data augmentation for audio datasets like frequency masking and random frequency-time stretching are used in the project and hence are explained in this paper. In the used methodology, the audio transforms principle were also tried and explored, and indeed the insights gained were used constructively in the later stages of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and Audio Processing · Music and Audio Processing
