Multi-Module G2P Converter for Persian Focusing on Relations between Words
Mahdi Rezaei, Negar Nayeri, Saeed Farzi, Hossein Sameti

TL;DR
This paper presents a multi-module G2P conversion system for Persian that leverages cross-word relations and sequence-level modeling to improve accuracy and speed over end-to-end approaches.
Contribution
The proposed multi-module system introduces a novel sequence-level approach using GRU and Transformer models to handle homographs, OOVs, and ezafe without pre-processing.
Findings
Achieved 94.48% word-level accuracy
Outperformed previous Persian G2P systems
Effective cross-word relation modeling
Abstract
In this paper, we investigate the application of end-to-end and multi-module frameworks for G2P conversion for the Persian language. The results demonstrate that our proposed multi-module G2P system outperforms our end-to-end systems in terms of accuracy and speed. The system consists of a pronunciation dictionary as our look-up table, along with separate models to handle homographs, OOVs and ezafe in Persian created using GRU and Transformer architectures. The system is sequence-level rather than word-level, which allows it to effectively capture the unwritten relations between words (cross-word information) necessary for homograph disambiguation and ezafe recognition without the need for any pre-processing. After evaluation, our system achieved a 94.48% word-level accuracy, outperforming the previous G2P systems for Persian.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgorithms and Data Compression · Peer-to-Peer Network Technologies · Web Data Mining and Analysis
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Softmax · Position-Wise Feed-Forward Layer · Layer Normalization · Adam · Byte Pair Encoding · Label Smoothing · Residual Connection
