A punishment voting algorithm based on super categories construction for acoustic scene classification
Weiping Zheng, Zhenyao Mo, Jiantao Yi

TL;DR
This paper introduces a punishment voting algorithm leveraging super categories constructed via spectral clustering, which enhances acoustic scene classification accuracy by combining DenseNet-like models and ensemble strategies.
Contribution
The paper presents a novel punishment voting method based on super categories, improving ensemble performance in acoustic scene classification.
Findings
Significant performance improvement on DCASE2017 dataset
Effective use of spectral clustering for super categories
Enhanced ensemble accuracy with punishment voting
Abstract
In acoustic scene classification researches, audio segment is usually split into multiple samples. Majority voting is then utilized to ensemble the results of the samples. In this paper, we propose a punishment voting algorithm based on the super categories construction method for acoustic scene classification. Specifically, we propose a DenseNet-like model as the base classifier. The base classifier is trained by the CQT spectrograms generated from the raw audio segments. Taking advantage of the results of the base classifier, we propose a super categories construction method using the spectral clustering. Super classifiers corresponding to the constructed super categories are further trained. Finally, the super classifiers are utilized to enhance the majority voting of the base classifier by punishment voting. Experiments show that the punishment voting obviously improves the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Speech and Audio Processing · Speech Recognition and Synthesis
