The Recognition Of Persian Phonemes Using PPNet

Saber Malekzadeh; Mohammad Hossein Gholizadeh; Hossein Ghayoumi zadeh,; Seyed Naser Razavi

arXiv:1812.08600·eess.AS·March 24, 2020

The Recognition Of Persian Phonemes Using PPNet

Saber Malekzadeh, Mohammad Hossein Gholizadeh, Hossein Ghayoumi zadeh,, Seyed Naser Razavi

PDF

Open Access

TL;DR

This paper introduces PPNet, a new deep convolutional neural network architecture, achieving a 75.87% accuracy in recognizing Persian phonemes, surpassing previous methods and enhancing speech recognition systems.

Contribution

The paper presents PPNet, a novel deep learning model specifically designed for Persian phoneme recognition, with improved accuracy over existing algorithms.

Findings

01

Achieved 75.87% accuracy on Persian phoneme recognition

02

Used STFT features with deep CNN architecture

03

Outperformed previous algorithms on PCVC dataset

Abstract

In this paper, a novel approach is proposed for the recognition of Persian phonemes in the Persian Consonant-Vowel Combination (PCVC) speech dataset. Nowadays, deep neural networks play a crucial role in classification tasks. However, the best results in speech recognition are not yet as perfect as human recognition rate. Deep learning techniques show outstanding performance over many other classification tasks like image classification, document classification, etc. Furthermore, the performance is sometimes better than a human. The reason why automatic speech recognition (ASR) systems are not as qualified as the human speech recognition system, mostly depends on features of data which is fed to deep neural networks. Methods: In this research, firstly, the sound samples are cut for the exact extraction of phoneme sounds in 50ms samples. Then, phonemes are divided into 30 groups,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis · Speech and Audio Processing · Music and Audio Processing