ECHO: Environmental Sound Classification with Hierarchical   Ontology-guided Semi-Supervised Learning

Pranav Gupta; Raunak Sharma; Rashmi Kumari; Sri Krishna Aditya,; Shwetank Choudhary; Sumit Kumar; Kanchana M; Thilagavathy R

arXiv:2409.14043·cs.SD·September 24, 2024

ECHO: Environmental Sound Classification with Hierarchical Ontology-guided Semi-Supervised Learning

Pranav Gupta, Raunak Sharma, Rashmi Kumari, Sri Krishna Aditya,, Shwetank Choudhary, Sumit Kumar, Kanchana M, Thilagavathy R

PDF

TL;DR

ECHO introduces a semi-supervised learning framework for environmental sound classification that leverages hierarchical label ontology and large language models to improve accuracy across multiple datasets.

Contribution

The paper presents a novel semi-supervised approach using ontology-guided pretext tasks and LLMs to enhance sound classification performance.

Findings

01

Achieves 1-8% accuracy improvement over baselines

02

Utilizes hierarchical label ontology for semantic learning

03

Effective across UrbanSound8K, ESC-10, and ESC-50 datasets

Abstract

Environment Sound Classification has been a well-studied research problem in the field of signal processing and up till now more focus has been laid on fully supervised approaches. Over the last few years, focus has moved towards semi-supervised methods which concentrate on the utilization of unlabeled data, and self-supervised methods which learn the intermediate representation through pretext task or contrastive learning. However, both approaches require a vast amount of unlabelled data to improve performance. In this work, we propose a novel framework called Environmental Sound Classification with Hierarchical Ontology-guided semi-supervised Learning (ECHO) that utilizes label ontology-based hierarchy to learn semantic representation by defining a novel pretext task. In the pretext task, the model tries to predict coarse labels defined by the Large Language Model (LLM) based on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsFocus