Learning Intonation Pattern Embeddings for Arabic Dialect Identification

Aitor Arronte Alvarez; Elsayed Sabry Abdelaal Issa

arXiv:2008.00667·eess.AS·August 4, 2020·Interspeech

Learning Intonation Pattern Embeddings for Arabic Dialect Identification

Aitor Arronte Alvarez, Elsayed Sabry Abdelaal Issa

PDF

TL;DR

This paper introduces an end-to-end system using intonation patterns and hybrid neural networks for Arabic Dialect Identification, achieving state-of-the-art results with robustness to limited data.

Contribution

It demonstrates that intonation patterns alone can effectively identify Arabic dialects, outperforming systems relying on multiple features, and emphasizes the sufficiency of information in acoustic modeling.

Findings

01

Achieved state-of-the-art results on VarDial 17 ADI dataset.

02

Intonation patterns alone provide sufficient information for dialect identification.

03

The proposed pipeline is robust to data sparsity.

Abstract

This article presents a full end-to-end pipeline for Arabic Dialect Identification (ADI) using intonation patterns and acoustic representations. Recent approaches to language and dialect identification use linguistic-aware deep architectures that are able to capture phonetic differences amongst languages and dialects. Specifically, in ADI tasks, different combinations of linguistic features and acoustic representations have been successful with deep learning models. The approach presented in this article uses intonation patterns and hybrid residual and bidirectional LSTM networks to learn acoustic embeddings with no additional linguistic information. Results of the experiments show that intonation patterns for Arabic dialects provide sufficient information to achieve state-of-the-art results on the VarDial 17 ADI dataset, outperforming single-feature systems. The pipeline presented is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.