CIPHER: Conformer-based Inference of Phonemes from High-density EEG
Varshith Madishetty

TL;DR
This paper introduces CIPHER, a dual-pathway model using ERP and broadband DDA features to decode phonemes from high-density EEG, providing a benchmark for EEG-based speech decoding.
Contribution
The work presents a novel Conformer-based model and a comprehensive benchmark for phoneme decoding from EEG, highlighting limitations and confound issues.
Findings
High performance on binary articulatory tasks but vulnerable to confounds.
Limited fine-grained discriminability in 11-class phoneme task.
Positioned as a benchmark rather than an EEG-to-text system.
Abstract
Decoding speech information from scalp EEG remains difficult due to low SNR and spatial blurring. We present CIPHER (Conformer-based Inference of Phonemes from High-density EEG Representations), a dual-pathway model using (i) ERP features and (ii) broadband DDA coefficients. On OpenNeuro ds006104 (24 participants, two studies with concurrent TMS), binary articulatory tasks reach near-ceiling performance but are highly confound-vulnerable (acoustic onset separability and TMS-target blocking). On the primary 11-class CVC phoneme task under full Study 2 LOSO (16 held-out subjects), performance is substantially lower (real-word WER: ERP 0.671 +/- 0.080, DDA 0.688 +/- 0.096, indicating limited fine-grained discriminability. We therefore position this work as a benchmark and feature-comparison study rather than an EEG-to-text system, and we constrain neural-representation claims to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
