ECHO: Environmental Sound Classification with Hierarchical Ontology-guided Semi-Supervised Learning
Pranav Gupta, Raunak Sharma, Rashmi Kumari, Sri Krishna Aditya,, Shwetank Choudhary, Sumit Kumar, Kanchana M, Thilagavathy R

TL;DR
ECHO introduces a semi-supervised learning framework for environmental sound classification that leverages hierarchical label ontology and large language models to improve accuracy across multiple datasets.
Contribution
The paper presents a novel semi-supervised approach using ontology-guided pretext tasks and LLMs to enhance sound classification performance.
Findings
Achieves 1-8% accuracy improvement over baselines
Utilizes hierarchical label ontology for semantic learning
Effective across UrbanSound8K, ESC-10, and ESC-50 datasets
Abstract
Environment Sound Classification has been a well-studied research problem in the field of signal processing and up till now more focus has been laid on fully supervised approaches. Over the last few years, focus has moved towards semi-supervised methods which concentrate on the utilization of unlabeled data, and self-supervised methods which learn the intermediate representation through pretext task or contrastive learning. However, both approaches require a vast amount of unlabelled data to improve performance. In this work, we propose a novel framework called Environmental Sound Classification with Hierarchical Ontology-guided semi-supervised Learning (ECHO) that utilizes label ontology-based hierarchy to learn semantic representation by defining a novel pretext task. In the pretext task, the model tries to predict coarse labels defined by the Large Language Model (LLM) based on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsFocus
