CA-TCN: A Causal-Anticausal Temporal Convolutional Network for Direct Auditory Attention Decoding
I\~nigo Garc\'ia-Ugarte, Rub\'en Eguinoa, Ricardo San Mart\'in, Daniel Paternain, Carmen Vidaurre

TL;DR
The paper introduces CA-TCN, a novel neural network architecture that improves auditory attention decoding accuracy by explicitly modeling neural and stimulus responses with causal and anticausal convolutions, advancing real-world auditory attention applications.
Contribution
It presents the CA-TCN model, integrating causal and anticausal convolutions for improved neural decoding, outperforming existing models in accuracy and robustness.
Findings
CA-TCN outperforms baseline models with 0.5%-3.2% accuracy gains.
Statistically significant improvements in 4 of 6 settings.
Model demonstrates spatial robustness across datasets.
Abstract
A promising approach for steering auditory attention in complex listening environments relies on Auditory Attention Decoding (AAD), which aim to identify the attended speech stream in a multiple speaker scenario from neural recordings. Entrainment-based AAD approaches, typically assume access to clean speech sources and electroencephalography (EEG) signals to exploit low-frequency correlations between the neural response and the attended stimulus. In this study, we propose CA-TCN, a Causal-Anticausal Temporal Convolutional Network that directly classifies the attended speaker. The proposed architecture integrates several best practices from convolutional neural networks in sequence processing tasks. Importantly, it explicitly aligns auditory stimuli and neural responses by employing separate causal and anticausal convolutions respectively, with distinct receptive fields operating in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
