How Does That Sound? Multi-Language SpokenName2Vec Algorithm Using Speech Generation and Deep Learning
Aviad Elyashar, Rami Puzis, Michael Fire

TL;DR
This paper introduces SpokenName2Vec, a deep learning-based method that generates speech-based name embeddings to improve similar name suggestion accuracy across languages and accents.
Contribution
It presents a novel approach using speech generation and deep learning for name similarity detection, outperforming existing phonetic and string similarity algorithms.
Findings
Outperforms 10 existing algorithms in name similarity tasks.
Effective across multiple languages and accents.
Demonstrated on large-scale dataset with 250,000 names.
Abstract
Searching for information about a specific person is an online activity frequently performed by many users. In most cases, users are aided by queries containing a name and sending back to the web search engines for finding their will. Typically, Web search engines provide just a few accurate results associated with a name-containing query. Currently, most solutions for suggesting synonyms in online search are based on pattern matching and phonetic encoding, however very often, the performance of such solutions is less than optimal. In this paper, we propose SpokenName2Vec, a novel and generic approach which addresses the similar name suggestion problem by utilizing automated speech generation, and deep learning to produce spoken name embeddings. This sophisticated and innovative embeddings captures the way people pronounce names in any language and accent. Utilizing the name…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Authorship Attribution and Profiling · Natural Language Processing Techniques
