Hybrid Approach to English-Hindi Name Entity Transliteration

Shruti Mathur; Varun Prakash Saxena

arXiv:1403.7455·cs.CL·March 31, 2014·5 cites

Hybrid Approach to English-Hindi Name Entity Transliteration

Shruti Mathur, Varun Prakash Saxena

PDF

Open Access

TL;DR

This paper presents a hybrid rule-based and statistical method for English-Hindi name entity transliteration, achieving 83.40% accuracy in converting English names to Hindi.

Contribution

It introduces a novel hybrid approach combining rule-based phoneme extraction with statistical translation for improved transliteration accuracy.

Findings

01

Achieved 83.40% transliteration accuracy

02

Demonstrated effectiveness of hybrid approach in Indian language MT

03

Improved transliteration over existing methods

Abstract

Machine translation (MT) research in Indian languages is still in its infancy. Not much work has been done in proper transliteration of name entities in this domain. In this paper we address this issue. We have used English-Hindi language pair for our experiments and have used a hybrid approach. At first we have processed English words using a rule based approach which extracts individual phonemes from the words and then we have applied statistical approach which converts the English into its equivalent Hindi phoneme and in turn the corresponding Hindi word. Through this approach we have attained 83.40% accuracy.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Speech Recognition and Synthesis · Topic Modeling