PolyIPA -- Multilingual Phoneme-to-Grapheme Conversion Model

Davor Lauc

arXiv:2412.09102·cs.CL·December 13, 2024

PolyIPA -- Multilingual Phoneme-to-Grapheme Conversion Model

Davor Lauc

PDF

Open Access

TL;DR

PolyIPA is a multilingual phoneme-to-grapheme conversion model that improves transliteration accuracy across languages using auxiliary models and beam search, benefiting onomastic research and information retrieval.

Contribution

The paper introduces PolyIPA, a novel multilingual conversion model with auxiliary models for data augmentation, achieving state-of-the-art performance in cross-linguistic phoneme-to-grapheme tasks.

Findings

01

Achieves a mean CER of 0.055 across multiple languages.

02

Character-level BLEU score of 0.914 indicating high translation quality.

03

Top-3 candidates reduce error rate by 52.7%.

Abstract

This paper presents PolyIPA, a novel multilingual phoneme-to-grapheme conversion model designed for multilingual name transliteration, onomastic research, and information retrieval. The model leverages two helper models developed for data augmentation: IPA2vec for finding soundalikes across languages, and similarIPA for handling phonetic notation variations. Evaluated on a test set that spans multiple languages and writing systems, the model achieves a mean Character Error Rate of 0.055 and a character-level BLEU score of 0.914, with particularly strong performance on languages with shallow orthographies. The implementation of beam search further improves practical utility, with top-3 candidates reducing the effective error rate by 52.7\% (to CER: 0.026), demonstrating the model's effectiveness for cross-linguistic applications.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis · Phonetics and Phonology Research · Natural Language Processing Techniques

MethodsSparse Evolutionary Training