Towards Localizing Conversation Partners using Head Motion
Payal Mohapatra, Calvin Murdock, Ali Aroudi, Ishwarya Ananthabhotla, Anjali Menon, Buye Xu, Morteza Khaleghimeybodi

TL;DR
This paper presents HALo, a head-orientation-based system using IMUs on smartglasses to localize conversation partners and improve speech enhancement in noisy environments.
Contribution
It introduces HALo and CoCo, novel methods leveraging head motion data for acoustic zone localization and conversation partner counting, outperforming existing approaches.
Findings
HALo improves localization performance by 21% with prior knowledge of speaker count.
CoCo achieves 0.74 accuracy in classifying number of conversation partners.
Head-orientation cues significantly enhance speech enhancement in noisy multi-speaker settings.
Abstract
Many individuals struggle to understand conversation partners in noisy settings, particularly amid background speakers or due to hearing impairments. Emerging wearables like smartglasses offer a transformative opportunity to enhance speech from conversation partners. Crucial to this is identifying the direction in which the user wants to listen, which we refer to as the user's acoustic zones of interest. While current spatial audio-based methods can resolve the direction of vocal input, they are agnostic to listening preferences and have limited functionality in noisy settings with interfering speakers. To address this, behavioral cues are needed to actively infer a user's acoustic zones of interest. We explore the effectiveness of head-orienting behavior, captured by Inertial Measurement Units (IMUs) on smartglasses, as a modality for localizing these zones in seated conversations. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
