Source -Free Domain Adaptation for Speaker Verification in Data-Scarce   Languages and Noisy Channels

Shlomo Salo Elia; Aviad Malachi; Vered Aharonson; Gadi Pinkas

arXiv:2406.05863·cs.SD·June 11, 2024

Source -Free Domain Adaptation for Speaker Verification in Data-Scarce Languages and Noisy Channels

Shlomo Salo Elia, Aviad Malachi, Vered Aharonson, Gadi Pinkas

PDF

Open Access

TL;DR

This paper proposes source-free domain adaptation techniques for speaker verification in data-scarce languages and noisy channels, addressing privacy and resource limitations by exploring fine-tuning and a new cluster-learn algorithm.

Contribution

It introduces a novel iterative cluster-learn algorithm and evaluates fine-tuning methods for source-free adaptation in challenging speech verification scenarios.

Findings

01

Fine-tuning improves performance with limited target data.

02

The cluster-learn algorithm effectively adapts to unlabeled target data.

03

Methods outperform baseline approaches in noisy and language-mismatched conditions.

Abstract

Domain adaptation is often hampered by exceedingly small target datasets and inaccessible source data. These conditions are prevalent in speech verification, where privacy policies and/or languages with scarce speech resources limit the availability of sufficient data. This paper explored techniques of sourcefree domain adaptation unto a limited target speech dataset for speaker verificationin data-scarce languages. Both language and channel mis-match between source and target were investigated. Fine-tuning methods were evaluated and compared across different sizes of labeled target data. A novel iterative cluster-learn algorithm was studied for unlabeled target datasets.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis · Speech and Audio Processing