An Active Machine Hearing System for Auditory Stream Segregation

Christopher Schymura; Thomas Walther; Dorothea Kolossa

arXiv:1606.07598·cs.SD·June 27, 2016

An Active Machine Hearing System for Auditory Stream Segregation

Christopher Schymura, Thomas Walther, Dorothea Kolossa

PDF

Open Access

TL;DR

This paper presents a binaural machine hearing system that performs auditory stream segregation using probabilistic clustering of sound source locations, incorporating head movements to enhance performance in complex auditory scenes.

Contribution

It introduces a novel probabilistic framework based on von Mises distributions for joint localization and segregation, mimicking human auditory stream grouping.

Findings

01

Effective segregation of multiple sound sources in complex scenes

02

Improved localization accuracy with head movements

03

Robust performance with speech and non-speech sounds

Abstract

This study describes a binaural machine hearing system that is capable of performing auditory stream segregation in scenarios where multiple sound sources are present. The process of stream segregation refers to the capability of human listeners to group acoustic signals into sets of distinct auditory streams, corresponding to individual sound sources. The proposed computational framework mimics this ability via a probabilistic clustering scheme for joint localization and segregation. This scheme is based on mixtures of von Mises distributions to model the angular positions of the sound sources surrounding the listener. The distribution parameters are estimated using block-wise processing of auditory cues extracted from binaural signals. Additionally, the proposed system can conduct rotational head movements to improve localization and stream segregation performance. Evaluation of the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and Audio Processing · Music and Audio Processing · Advanced Adaptive Filtering Techniques