CA-TCN: A Causal-Anticausal Temporal Convolutional Network for Direct Auditory Attention Decoding

I\~nigo Garc\'ia-Ugarte; Rub\'en Eguinoa; Ricardo San Mart\'in; Daniel Paternain; Carmen Vidaurre

arXiv:2603.26394·cs.SD·March 30, 2026

CA-TCN: A Causal-Anticausal Temporal Convolutional Network for Direct Auditory Attention Decoding

I\~nigo Garc\'ia-Ugarte, Rub\'en Eguinoa, Ricardo San Mart\'in, Daniel Paternain, Carmen Vidaurre

PDF

TL;DR

The paper introduces CA-TCN, a novel neural network architecture that improves auditory attention decoding accuracy by explicitly modeling neural and stimulus responses with causal and anticausal convolutions, advancing real-world auditory attention applications.

Contribution

It presents the CA-TCN model, integrating causal and anticausal convolutions for improved neural decoding, outperforming existing models in accuracy and robustness.

Findings

01

CA-TCN outperforms baseline models with 0.5%-3.2% accuracy gains.

02

Statistically significant improvements in 4 of 6 settings.

03

Model demonstrates spatial robustness across datasets.

Abstract

A promising approach for steering auditory attention in complex listening environments relies on Auditory Attention Decoding (AAD), which aim to identify the attended speech stream in a multiple speaker scenario from neural recordings. Entrainment-based AAD approaches, typically assume access to clean speech sources and electroencephalography (EEG) signals to exploit low-frequency correlations between the neural response and the attended stimulus. In this study, we propose CA-TCN, a Causal-Anticausal Temporal Convolutional Network that directly classifies the attended speaker. The proposed architecture integrates several best practices from convolutional neural networks in sequence processing tasks. Importantly, it explicitly aligns auditory stimuli and neural responses by employing separate causal and anticausal convolutions respectively, with distinct receptive fields operating in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.