Joint Scattering for Automatic Chick Call Recognition
Changhong Wang, Emmanouil Benetos, Shuge Wang, Elisabetta Versace

TL;DR
This paper introduces an automatic chick call recognition system using joint time-frequency scattering transform features, significantly improving classification accuracy over traditional methods in laboratory-recorded datasets.
Contribution
The paper presents a novel chick call recognition system leveraging JTFS features, enhancing accuracy compared to baseline methods.
Findings
JTFS features improve macro F-measure by 9.5% and 11.7%.
System effectively classifies chick calls in laboratory conditions.
Proposed method outperforms mel-frequency cepstral coefficients baseline.
Abstract
Animal vocalisations contain important information about health, emotional state, and behaviour, thus can be potentially used for animal welfare monitoring. Motivated by the spectro-temporal patterns of chick calls in the timefrequency domain, in this paper we propose an automatic system for chick call recognition using the joint timefrequency scattering transform (JTFS). Taking full-length recordings as input, the system first extracts chick call candidates by an onset detector and silence removal. After computing their JTFS features, a support vector machine classifier groups each candidate into different chick call types. Evaluating on a dataset comprising 3013 chick calls collected in laboratory conditions, the proposed recognition system using the JTFS features improves the frame- and event-based macro F-measures by 9.5% and 11.7%, respectively, than that of a mel-frequency…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnimal Vocal Communication and Behavior · Animal Behavior and Welfare Studies · Music and Audio Processing
