Leveraging Native Language Speech for Accent Identification using Deep   Siamese Networks

Aditya Siddhant; Preethi Jyothi; Sriram Ganapathy

arXiv:1712.08992·cs.CL·June 20, 2018

Leveraging Native Language Speech for Accent Identification using Deep Siamese Networks

Aditya Siddhant, Preethi Jyothi, Sriram Ganapathy

PDF

TL;DR

This paper introduces a deep Siamese network approach for accent identification that leverages native language speech data, achieving significant accuracy improvements over traditional methods.

Contribution

It presents a novel deep Siamese network model trained on native and accented speech to improve accent identification accuracy.

Findings

01

15.4% relative performance improvement over baseline

02

Effective use of native language speech data

03

Detailed error analysis provided

Abstract

The problem of automatic accent identification is important for several applications like speaker profiling and recognition as well as for improving speech recognition systems. The accented nature of speech can be primarily attributed to the influence of the speaker's native language on the given speech recording. In this paper, we propose a novel accent identification system whose training exploits speech in native languages along with the accented speech. Specifically, we develop a deep Siamese network-based model which learns the association between accented speech recordings and the native language speech recordings. The Siamese networks are trained with i-vector features extracted from the speech recordings using either an unsupervised Gaussian mixture model (GMM) or a supervised deep neural network (DNN) model. We perform several accent identification experiments using the CSLU…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.