Exploring rhythm formant analysis for Indic language classification
Parismita Gogoi, Sishir Kalita, Priyankoo Sarmah, S.R, Mahadeva Prasanna

TL;DR
This study investigates the use of rhythm formant analysis in the frequency domain to classify five Indian languages, showing that spectral features and temporal rhythm patterns improve classification accuracy.
Contribution
It introduces a quantitative rhythm analysis method using low-frequency spectral features for Indian language classification, demonstrating its effectiveness.
Findings
Spectral features outperform direct R-formant measures.
Temporal rhythm patterns enhance language discrimination.
Achieved approximately 69% accuracy in classifying five languages.
Abstract
This paper reports a preliminary study on quantitative frequency domain rhythm cues for classifying five Indian languages: Bengali, Kannada, Malayalam, Marathi, and Tamil. We employ rhythm formant (R-formants) analysis, a technique introduced by Gibbon that utilizes low-frequency spectral analysis of amplitude modulation and frequency modulation envelopes to characterize speech rhythm. Various measures are computed from the LF spectrum, including R-formants, discrete cosine transform-based measures, and spectral measures. Results show that threshold-based and spectral features outperform directly computed R-formants. Temporal pattern of rhythm derived from LF spectrograms provides better language-discriminating cues. Combining all derived features we achieve an accuracy of 69.21% and a weighted F1 score of 69.18% in classifying the five languages. This study demonstrates the potential…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Language and cultural evolution
