The Recognition Of Persian Phonemes Using PPNet
Saber Malekzadeh, Mohammad Hossein Gholizadeh, Hossein Ghayoumi zadeh,, Seyed Naser Razavi

TL;DR
This paper introduces PPNet, a new deep convolutional neural network architecture, achieving a 75.87% accuracy in recognizing Persian phonemes, surpassing previous methods and enhancing speech recognition systems.
Contribution
The paper presents PPNet, a novel deep learning model specifically designed for Persian phoneme recognition, with improved accuracy over existing algorithms.
Findings
Achieved 75.87% accuracy on Persian phoneme recognition
Used STFT features with deep CNN architecture
Outperformed previous algorithms on PCVC dataset
Abstract
In this paper, a novel approach is proposed for the recognition of Persian phonemes in the Persian Consonant-Vowel Combination (PCVC) speech dataset. Nowadays, deep neural networks play a crucial role in classification tasks. However, the best results in speech recognition are not yet as perfect as human recognition rate. Deep learning techniques show outstanding performance over many other classification tasks like image classification, document classification, etc. Furthermore, the performance is sometimes better than a human. The reason why automatic speech recognition (ASR) systems are not as qualified as the human speech recognition system, mostly depends on features of data which is fed to deep neural networks. Methods: In this research, firstly, the sound samples are cut for the exact extraction of phoneme sounds in 50ms samples. Then, phonemes are divided into 30 groups,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech Recognition and Synthesis · Speech and Audio Processing · Music and Audio Processing
