Do Acoustic Word Embeddings Capture Phonological Similarity? An   Empirical Study

Badr M. Abdullah; Marius Mosbach; Iuliia Zaitova; Bernd M\"obius,; Dietrich Klakow

arXiv:2106.08686·cs.CL·June 17, 2021

Do Acoustic Word Embeddings Capture Phonological Similarity? An Empirical Study

Badr M. Abdullah, Marius Mosbach, Iuliia Zaitova, Bernd M\"obius,, Dietrich Klakow

PDF

1 Repo

TL;DR

This study investigates whether acoustic word embeddings truly reflect phonological similarity, revealing only moderate correlation and challenging current evaluation methods for AWEs.

Contribution

It provides an empirical analysis of neural AWE models across languages, showing their limitations in capturing phonological relationships.

Findings

01

Embedding distances only moderately correlate with phonological dissimilarity.

02

Improved word discrimination performance does not guarantee better phonological similarity modeling.

03

Current intrinsic evaluation methods for AWEs may need reconsideration.

Abstract

Several variants of deep neural networks have been successfully employed for building parametric models that project variable-duration spoken word segments onto fixed-size vector representations, or acoustic word embeddings (AWEs). However, it remains unclear to what degree we can rely on the distance in the emerging AWE space as an estimate of word-form similarity. In this paper, we ask: does the distance in the acoustic embedding space correlate with phonological dissimilarity? To answer this question, we empirically investigate the performance of supervised approaches for AWEs with different neural architectures and learning objectives. We train AWE models in controlled settings for two languages (German and Czech) and evaluate the embeddings on two tasks: word discrimination and phonological similarity. Our experiments show that (1) the distance in the embedding space in the best…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

uds-lsv/AWEs_phon_sim
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.