Elementary, My Dear Watson: Non-Invasive Neural Keyword Spotting in the LibriBrain Dataset
Gereon Elvers, Gilad Landau, Oiwi Parker Jones

TL;DR
This paper introduces a non-invasive brain-computer interface benchmark for keyword spotting using the LibriBrain dataset, providing standardized evaluation protocols, a baseline model, and insights into factors affecting detection performance.
Contribution
It proposes a new benchmark task for brain-to-text communication, with standardized data splits, evaluation metrics, and a baseline model, advancing research in practical neural keyword detection.
Findings
Baseline model achieves 13x higher AUPRC than permutation baseline.
Performance improves log-linearly with more training hours.
Word frequency and duration systematically affect detectability.
Abstract
Non-invasive brain-computer interfaces (BCIs) are beginning to benefit from large, public benchmarks. However, current benchmarks target relatively simple, foundational tasks like Speech Detection and Phoneme Classification, while application-ready results on tasks like Brain-to-Text remain elusive. We propose Keyword Spotting (KWS) as a practically applicable, privacy-aware intermediate task. Using the deep 52-hour, within-subject LibriBrain corpus, we provide standardized train/validation/test splits for reproducible benchmarking, and adopt an evaluation protocol tailored to extreme class imbalance. Concretely, we use area under the precision-recall curve (AUPRC) as a robust evaluation metric, complemented by false alarms per hour (FA/h) at fixed recall to capture user-facing trade-offs. To simplify deployment and further experimentation within the research community, we are releasing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
