Subjective and objective experiments on the influence of speaker's   gender on the unvoiced segments

A Madhavaraj; T V Ananthapadmanabha; A G Ramakrishnan

arXiv:1807.05813·cs.SD·July 17, 2018

Subjective and objective experiments on the influence of speaker's gender on the unvoiced segments

A Madhavaraj, T V Ananthapadmanabha, A G Ramakrishnan

PDF

Open Access

TL;DR

This study investigates how a speaker's gender affects unvoiced speech sounds through subjective and objective experiments, revealing that unvoiced segments are largely speaker-independent and not strongly influenced by gender.

Contribution

It provides new evidence that unvoiced speech segments are minimally affected by speaker gender, using both perceptual tests and HMM-based recognition experiments.

Findings

01

Humans cannot reliably distinguish gender-altered unvoiced segments.

02

Recognition performance is only slightly affected when testing across genders.

03

Unvoiced sounds are largely speaker-independent in terms of gender influence.

Abstract

Subjective and objective experiments are conducted to understand the extent to which a speaker's gender influences the acoustics of unvoiced (U) sounds. U segments of utterances are replaced by the corresponding segments of a speaker of opposite gender to prepare modified utterances. Humans are asked to judge if the modified utterance is spoken by one or two speakers. The experiments show that human subjects are unable to distinguish the modified from the original. Thus, listeners are able to identify the U segments irrespective of the gender, which may be based on some speaker-independent invariant acoustic cues. To test if this finding is purely a perceptual phenomenon, objective experiments are also conducted. Gender specific HMM based phoneme recognition systems are trained using the TIMIT training set and tested on (a) utterances spoken by the same gender (b) utterances spoken by…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and Audio Processing · Speech Recognition and Synthesis · Music and Audio Processing