A Metric to Classify Style of Spoken Speech

Sunil Kopparapu; Saurabh Bhatnagar; K. Sahana; Sathyanarayana,; Akhilesh Srivastava; P.V.S. Rao

arXiv:1504.01427·cs.CL·April 8, 2015

A Metric to Classify Style of Spoken Speech

Sunil Kopparapu, Saurabh Bhatnagar, K. Sahana, Sathyanarayana,, Akhilesh Srivastava, P.V.S. Rao

PDF

Open Access

TL;DR

This paper introduces a new metric for classifying spoken speech style, which outperforms human experts in accuracy, aiding BPOs in identifying speakers with specific accents efficiently.

Contribution

The paper presents a novel, robust metric for classifying speech style that reduces human bias and improves accuracy in BPO applications.

Findings

01

The metric outperforms two human experts in classification accuracy.

02

The system was tested on speech data from over seventy BPO employees.

03

Experimental results demonstrate the metric's robustness and reliability.

Abstract

The ability to classify spoken speech based on the style of speaking is an important problem. With the advent of BPO's in recent times, specifically those that cater to a population other than the local population, it has become necessary for BPO's to identify people with certain style of speaking (American, British etc). Today BPO's employ accent analysts to identify people having the required style of speaking. This process while involving human bias, it is becoming increasingly infeasible because of the high attrition rate in the BPO industry. In this paper, we propose a new metric, which robustly and accurately helps classify spoken speech based on the style of speaking. The role of the proposed metric is substantiated by using it to classify real speech data collected from over seventy different people working in a BPO. We compare the performance of the metric against human experts…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and Audio Processing