"OK Aura, Be Fair With Me": Demographics-Agnostic Training for Bias Mitigation in Wake-up Word Detection

Fernando L\'opez; Paula Delgado-Santos; Pablo G\'omez; David Solans; Jordi Luque

arXiv:2604.05830·cs.CL·April 8, 2026

"OK Aura, Be Fair With Me": Demographics-Agnostic Training for Bias Mitigation in Wake-up Word Detection

Fernando L\'opez, Paula Delgado-Santos, Pablo G\'omez, David Solans, Jordi Luque

PDF

TL;DR

This paper investigates demographics-agnostic training methods to reduce bias in wake-up word detection, demonstrating significant improvements in fairness across sex, age, and accent groups using the OK Aura dataset.

Contribution

It introduces label-agnostic training techniques, including data augmentation and knowledge distillation, to mitigate demographic biases in wake-up word detection models.

Findings

01

Demographic bias reduced by up to 83.65% in age groups.

02

Data augmentation and knowledge distillation improve fairness.

03

Significant bias reduction compared to baseline models.

Abstract

Voice-based interfaces are widely used; however, achieving fair Wake-up Word detection across diverse speaker populations remains a critical challenge due to persistent demographic biases. This study evaluates the effectiveness of demographics-agnostic training techniques in mitigating performance disparities among speakers of varying sex, age, and accent. We utilize the OK Aura database for our experiments, employing a training methodology that excludes demographic labels, which are reserved for evaluation purposes. We explore (i) data augmentation techniques to enhance model generalization and (ii) knowledge distillation of pre-trained foundational speech models. The experimental results indicate that these demographics-agnostic training techniques markedly reduce demographic bias, leading to a more equitable performance profile across different speaker groups. Specifically, one of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.