Advanced Framework for Animal Sound Classification With Features Optimization
Qiang Yang, Xiuying Chen, Changsheng Ma, Carlos M. Duarte, and, Xiangliang Zhang

TL;DR
This paper introduces an optimized feature extraction and an attention-based Bi-LSTM framework for animal sound classification, significantly improving accuracy over existing methods in diverse and noisy environments.
Contribution
It presents a novel feature optimization process combined with an attention-based Bi-LSTM model tailored for animal sound classification, along with a new benchmark dataset.
Findings
Outperforms baseline methods by over 25% in key metrics
Effective feature optimization enhances model performance
Benchmark dataset includes oceanic animals and birds
Abstract
The automatic classification of animal sounds presents an enduring challenge in bioacoustics, owing to the diverse statistical properties of sound signals, variations in recording equipment, and prevalent low Signal-to-Noise Ratio (SNR) conditions. Deep learning models like Convolutional Neural Networks (CNN) and Long Short-Term Memory (LSTM) have excelled in human speech recognition but have not been effectively tailored to the intricate nature of animal sounds, which exhibit substantial diversity even within the same domain. We propose an automated classification framework applicable to general animal sound classification. Our approach first optimizes audio features from Mel-frequency cepstral coefficients (MFCC) including feature rearrangement and feature reduction. It then uses the optimized features for the deep learning model, i.e., an attention-based Bidirectional LSTM (Bi-LSTM),…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Animal Vocal Communication and Behavior · Food Supply Chain Traceability
MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory
