Description on IEEE ICME 2024 Grand Challenge: Semi-supervised Acoustic   Scene Classification under Domain Shift

Jisheng Bai; Mou Wang; Haohe Liu; Han Yin; Yafei Jia; Siwei Huang,; Yutong Du; Dongzhe Zhang; Dongyuan Shi; Woon-Seng Gan; Mark D. Plumbley,; Susanto Rahardja; Bin Xiang; Jianfeng Chen

arXiv:2402.02694·eess.AS·March 1, 2024·5 cites

Description on IEEE ICME 2024 Grand Challenge: Semi-supervised Acoustic Scene Classification under Domain Shift

Jisheng Bai, Mou Wang, Haohe Liu, Han Yin, Yafei Jia, Siwei Huang,, Yutong Du, Dongzhe Zhang, Dongyuan Shi, Woon-Seng Gan, Mark D. Plumbley,, Susanto Rahardja, Bin Xiang, Jianfeng Chen

PDF

Open Access 1 Repo

TL;DR

This paper introduces a new challenge for semi-supervised acoustic scene classification that addresses domain shift issues across different regions and encourages innovative semi-supervised learning methods to improve model robustness.

Contribution

It presents the ICME 2024 Grand Challenge focusing on semi-supervised learning for ASC under domain shift, highlighting the need to utilize unlabeled data and address geographical discrepancies.

Findings

01

Progress in device generalization for ASC

02

Recognition of the need to address geographical domain shifts

03

Encouragement of innovative semi-supervised techniques

Abstract

Acoustic scene classification (ASC) is a crucial research problem in computational auditory scene analysis, and it aims to recognize the unique acoustic characteristics of an environment. One of the challenges of the ASC task is the domain shift between training and testing data. Since 2018, ASC challenges have focused on the generalization of ASC models across different recording devices. Although this task, in recent years, has achieved substantial progress in device generalization, the challenge of domain shift between different geographical regions, involving discrepancies such as time, space, culture, and language, remains insufficiently explored at present. In addition, considering the abundance of unlabeled acoustic scene data in the real world, it is important to study the possible ways to utilize these unlabelled data. Therefore, we introduce the task Semi-supervised Acoustic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jishengbai/icme2024asc
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and Audio Processing · Music and Audio Processing · Speech Recognition and Synthesis