How Familiar Does That Sound? Cross-Lingual Representational Similarity   Analysis of Acoustic Word Embeddings

Badr M. Abdullah; Iuliia Zaitova; Tania Avgustinova; Bernd M\"obius,; Dietrich Klakow

arXiv:2109.10179·cs.CL·September 22, 2021

How Familiar Does That Sound? Cross-Lingual Representational Similarity Analysis of Acoustic Word Embeddings

Badr M. Abdullah, Iuliia Zaitova, Tania Avgustinova, Bernd M\"obius,, Dietrich Klakow

PDF

Open Access 1 Repo

TL;DR

This study uses representational similarity analysis to examine how neural network speech models trained on different languages perceive unknown languages, revealing that typological similarity influences their internal representations.

Contribution

Introduces a novel RSA-based method to analyze cross-lingual acoustic word embeddings and demonstrates the impact of language typology on neural speech representations.

Findings

01

Typological similarity affects model representational similarity.

02

RSA effectively quantifies cross-lingual speech processing.

03

Language similarity influences neural network speech perception.

Abstract

How do neural networks "perceive" speech sounds from unknown languages? Does the typological similarity between the model's training language (L1) and an unknown language (L2) have an impact on the model representations of L2 speech signals? To answer these questions, we present a novel experimental design based on representational similarity analysis (RSA) to analyze acoustic word embeddings (AWEs) -- vector representations of variable-duration spoken-word segments. First, we train monolingual AWE models on seven Indo-European languages with various degrees of typological similarity. We then employ RSA to quantify the cross-lingual similarity by simulating native and non-native spoken-word processing using AWEs. Our experiments show that typological similarity indeed affects the representational similarity of the models in our study. We further discuss the implications of our work on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

uds-lsv/xrsa-awes
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis · Topic Modeling · Natural Language Processing Techniques