Hybrid Neural Networks for On-device Directional Hearing

Anran Wang; Maruchi Kim; Hao Zhang; Shyamnath Gollakota

arXiv:2112.05893·cs.SD·December 14, 2021

Hybrid Neural Networks for On-device Directional Hearing

Anran Wang, Maruchi Kim, Hao Zhang, Shyamnath Gollakota

PDF

1 Repo 1 Video

TL;DR

This paper introduces DeepBeam, a hybrid neural network model that combines traditional beamformers with lightweight neural nets to enable real-time, low-latency directional hearing on wearable devices, achieving significant reductions in computational resources.

Contribution

DeepBeam is a novel hybrid model that improves efficiency and generalizability for on-device directional hearing by combining traditional beamformers with a custom neural network.

Findings

01

Comparable performance to state-of-the-art models on synthetic data

02

5x reduction in model size and computation

03

Real-time operation at 8 ms on low-power CPUs

Abstract

On-device directional hearing requires audio source separation from a given direction while achieving stringent human-imperceptible latency requirements. While neural nets can achieve significantly better performance than traditional beamformers, all existing models fall short of supporting low-latency causal inference on computationally-constrained wearables. We present DeepBeam, a hybrid model that combines traditional beamformers with a custom lightweight neural net. The former reduces the computational burden of the latter and also improves its generalizability, while the latter is designed to further reduce the memory and computational overhead to enable real-time and low-latency operations. Our evaluation shows comparable performance to state-of-the-art causal inference models on synthetic data while achieving a 5x reduction of model size, 4x reduction of computation per second,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

wanganran/HybridBeam
noneOfficial

Videos

Hybrid Neural Networks for On-Device Directional Hearing· underline