Deep Space Separable Distillation for Lightweight Acoustic Scene   Classification

ShuQi Ye; Yuan Tian

arXiv:2405.03567·cs.SD·May 7, 2024

Deep Space Separable Distillation for Lightweight Acoustic Scene Classification

ShuQi Ye, Yuan Tian

PDF

Open Access

TL;DR

This paper introduces a lightweight deep space separable distillation network for acoustic scene classification that reduces computational complexity and improves performance using novel operators and frequency decomposition.

Contribution

The paper proposes a novel deep space separable distillation network with specialized lightweight operators and frequency decomposition for improved acoustic scene classification.

Findings

01

Achieves 9.8% performance gain over existing methods

02

Reduces model parameters and computational complexity

03

Maintains high accuracy with lightweight design

Abstract

Acoustic scene classification (ASC) is highly important in the real world. Recently, deep learning-based methods have been widely employed for acoustic scene classification. However, these methods are currently not lightweight enough as well as their performance is not satisfactory. To solve these problems, we propose a deep space separable distillation network. Firstly, the network performs high-low frequency decomposition on the log-mel spectrogram, significantly reducing computational complexity while maintaining model performance. Secondly, we specially design three lightweight operators for ASC, including Separable Convolution (SC), Orthonormal Separable Convolution (OSC), and Separable Partial Convolution (SPC). These operators exhibit highly efficient feature extraction capabilities in acoustic scene classification tasks. The experimental results demonstrate that the proposed…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsWater Systems and Optimization · Speech and Audio Processing · Advanced Chemical Sensor Technologies

MethodsConvolution