Unsupervised adversarial domain adaptation for acoustic scene   classification

Shayan Gharib; Konstantinos Drossos; Emre \c{C}akir; Dmitriy Serdyuk,; and Tuomas Virtanen

arXiv:1808.05777·eess.AS·August 20, 2018·44 cites

Unsupervised adversarial domain adaptation for acoustic scene classification

Shayan Gharib, Konstantinos Drossos, Emre \c{C}akir, Dmitriy Serdyuk,, and Tuomas Virtanen

PDF

Open Access 1 Repo

TL;DR

This paper introduces the first unsupervised adversarial domain adaptation method for acoustic scene classification, significantly improving accuracy on unseen, unlabeled data with minimal impact on labeled data.

Contribution

It presents a novel unsupervised adversarial approach for domain adaptation in acoustic scene classification, addressing mismatched conditions without requiring labels for target data.

Findings

01

Achieved approximately 10% increase in accuracy on unseen, unlabeled data.

02

Maintained nearly the same performance on labeled source data.

03

Demonstrated effectiveness using DCASE 2018 dataset with mismatched recording devices.

Abstract

A general problem in acoustic scene classification task is the mismatched conditions between training and testing data, which significantly reduces the performance of the developed methods on classification accuracy. As a countermeasure, we present the first method of unsupervised adversarial domain adaptation for acoustic scene classification. We employ a model pre-trained on data from one set of conditions and by using data from other set of conditions, we adapt the model in order that its output cannot be used for classifying the set of conditions that input data belong to. We use a freely available dataset from the DCASE 2018 challenge Task 1, subtask B, that contains data from mismatched recording devices. We consider the scenario where the annotations are available for the data recorded from one device, but not for the rest. Our results show that with our model agnostic method we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

shayangharib/AUDASC
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMusic and Audio Processing · Geophysical Methods and Applications · Speech and Audio Processing