Elementary, My Dear Watson: Non-Invasive Neural Keyword Spotting in the LibriBrain Dataset

Gereon Elvers; Gilad Landau; Oiwi Parker Jones

arXiv:2510.21038·cs.LG·October 31, 2025

Elementary, My Dear Watson: Non-Invasive Neural Keyword Spotting in the LibriBrain Dataset

Gereon Elvers, Gilad Landau, Oiwi Parker Jones

PDF

TL;DR

This paper introduces a non-invasive brain-computer interface benchmark for keyword spotting using the LibriBrain dataset, providing standardized evaluation protocols, a baseline model, and insights into factors affecting detection performance.

Contribution

It proposes a new benchmark task for brain-to-text communication, with standardized data splits, evaluation metrics, and a baseline model, advancing research in practical neural keyword detection.

Findings

01

Baseline model achieves 13x higher AUPRC than permutation baseline.

02

Performance improves log-linearly with more training hours.

03

Word frequency and duration systematically affect detectability.

Abstract

Non-invasive brain-computer interfaces (BCIs) are beginning to benefit from large, public benchmarks. However, current benchmarks target relatively simple, foundational tasks like Speech Detection and Phoneme Classification, while application-ready results on tasks like Brain-to-Text remain elusive. We propose Keyword Spotting (KWS) as a practically applicable, privacy-aware intermediate task. Using the deep 52-hour, within-subject LibriBrain corpus, we provide standardized train/validation/test splits for reproducible benchmarking, and adopt an evaluation protocol tailored to extreme class imbalance. Concretely, we use area under the precision-recall curve (AUPRC) as a robust evaluation metric, complemented by false alarms per hour (FA/h) at fixed recall to capture user-facing trade-offs. To simplify deployment and further experimentation within the research community, we are releasing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.