Acoustically-Driven Phoneme Removal That Preserves Vocal Affect Cues

Camille Noufi; Jonathan Berger; Karen J. Parker; Daniel L. Bowling

arXiv:2210.15001·eess.AS·March 15, 2023

Acoustically-Driven Phoneme Removal That Preserves Vocal Affect Cues

Camille Noufi, Jonathan Berger, Karen J. Parker, Daniel L. Bowling

PDF

Open Access

TL;DR

This paper introduces a novel acoustical method combining speech and EGG signals to remove linguistic content from speech, enabling the study of vocal affect without language interference, useful for clinical assessments.

Contribution

The proposed technique uniquely isolates paralinguistic affect cues by removing phonetic information while preserving vocal affect signals, using combined speech and EGG data.

Findings

01

High similarity in affect perception between original and transformed signals

02

Effective removal of linguistic content while retaining affect cues

03

Potential applications in clinical affect sensitivity testing

Abstract

In this paper, we propose a method for removing linguistic information from speech for the purpose of isolating paralinguistic indicators of affect. The immediate utility of this method lies in clinical tests of sensitivity to vocal affect that are not confounded by language, which is impaired in a variety of clinical populations. The method is based on simultaneous recordings of speech audio and electroglottographic (EGG) signals. The speech audio signal is used to estimate the average vocal tract filter response and amplitude envelop. The EGG signal supplies a direct correlate of voice source activity that is mostly independent of phonetic articulation. The dynamic energy of the speech audio and the average vocal tract filter are applied to the EGG signal create a third signal designed to capture as much paralinguistic information from the vocal production system as possible --…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and Audio Processing · Voice and Speech Disorders · Speech Recognition and Synthesis