Cross-Corpora Spoken Language Identification with Domain Diversification and Generalization
Spandan Dey, Md Sahidullah, Goutam Saha

TL;DR
This paper tackles the challenge of improving cross-corpora spoken language identification by proposing domain diversification and generalization techniques, demonstrating significant performance gains on Indian LID datasets and in-the-wild evaluations.
Contribution
It introduces novel domain diversification and generalization methods, including maximally diversity-aware cascaded augmentations and pseudo-domain approaches, enhancing cross-corpora LID performance.
Findings
Domain diversification outperforms simple augmentation methods.
Domain generalization is more effective than diversification for cross-corpora.
Domain-invariant learning benefits cross-corpora generalization, while domain-aware suits same-corpora.
Abstract
This work addresses the cross-corpora generalization issue for the low-resourced spoken language identification (LID) problem. We have conducted the experiments in the context of Indian LID and identified strikingly poor cross-corpora generalization due to corpora-dependent non-lingual biases. Our contribution to this work is twofold. First, we propose domain diversification, which diversifies the limited training data using different audio data augmentation methods. We then propose the concept of maximally diversity-aware cascaded augmentations and optimize the augmentation fold-factor for effective diversification of the training data. Second, we introduce the idea of domain generalization considering the augmentation methods as pseudo-domains. Towards this, we investigate both domain-invariant and domain-aware approaches. Our LID system is based on the state-of-the-art emphasized…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsTest
