Hybrid Handcrafted and Learnable Audio Representation for Analysis of   Speech Under Cognitive and Physical Load

Gasser Elbanna; Alice Biryukov; Neil Scheidwasser-Clow; Lara Orlandic,; Pablo Mainar; Mikolaj Kegler; Pierre Beckmann; Milos Cernak

arXiv:2203.16637·cs.SD·October 26, 2022

Hybrid Handcrafted and Learnable Audio Representation for Analysis of Speech Under Cognitive and Physical Load

Gasser Elbanna, Alice Biryukov, Neil Scheidwasser-Clow, Lara Orlandic,, Pablo Mainar, Mikolaj Kegler, Pierre Beckmann, Milos Cernak

PDF

1 Repo

TL;DR

This paper presents a novel hybrid audio representation combining handcrafted DSP features and deep neural network learning, improving stress detection in speech under cognitive and physical load.

Contribution

It introduces a new self-supervised audio representation that outperforms existing handcrafted and DNN-based methods for stress detection in speech.

Findings

01

Hybrid representation outperforms traditional DSP features.

02

Hybrid approach surpasses pure DNN-based representations.

03

New datasets for task load detection in speech are provided.

Abstract

As a neurophysiological response to threat or adverse conditions, stress can affect cognition, emotion and behaviour with potentially detrimental effects on health in the case of sustained exposure. Since the affective content of speech is inherently modulated by an individual's physical and mental state, a substantial body of research has been devoted to the study of paralinguistic correlates of stress-inducing task load. Historically, voice stress analysis (VSA) has been conducted using conventional digital signal processing (DSP) techniques. Despite the development of modern methods based on deep neural networks (DNNs), accurately detecting stress in speech remains difficult due to the wide variety of stressors and considerable variability in the individual stress perception. To that end, we introduce a set of five datasets for task load detection in speech. The voice recordings were…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

gasserelbanna/serab-byols
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.