Feature Pyramid Attention based Residual Neural Network for Environmental Sound Classification
Liguang Zhou, Yuhongze Zhou, Xiaonan Qi, Junjie Hu, Tin Lun Lam,, Yangsheng Xu

TL;DR
This paper introduces a feature pyramid attention network (FPAM) that enhances environmental sound classification by focusing on semantically relevant features across multiple scales, improving accuracy over baseline methods.
Contribution
The paper proposes a novel end-to-end FPAM framework that localizes important frequency and temporal features in spectrograms for better sound classification.
Findings
FPAM achieves comparable performance to state-of-the-art methods.
Significant performance improvements over baseline models.
Attention maps focus on semantically relevant regions in spectrograms.
Abstract
Environmental sound classification (ESC) is a challenging problem due to the unstructured spatial-temporal relations that exist in the sound signals. Recently, many studies have focused on abstracting features from convolutional neural networks while the learning of semantically relevant frames of sound signals has been overlooked. To this end, we present an end-to-end framework, namely feature pyramid attention network (FPAM), focusing on abstracting the semantically relevant features for ESC. We first extract the feature maps of the preprocessed spectrogram of the sound waveform by a backbone network. Then, to build multi-scale hierarchical features of sound spectrograms, we construct a feature pyramid representation of the sound spectrograms by aggregating the feature maps from multi-scale layers, where the temporal frames and spatial locations of semantically relevant frames are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Noise Effects and Management · Diverse Musicological Studies
