# TSFNet: Temporal-Spatial Fusion Network for Hybrid Brain-Computer Interface

**Authors:** Yan Zhang, Bo Yin, Xiaoyang Yuan

PMC · DOI: 10.3390/s25196111 · 2025-10-03

## TL;DR

This paper introduces TSFNet, a new deep learning network that improves brain-computer interfaces by combining EEG and fNIRS signals more effectively.

## Contribution

The novel Temporal-Spatial Fusion Network (TSFNet) with EFGF and CAFÉ layers enables deep fusion of EEG and fNIRS signals for hybrid BCIs.

## Key findings

- TSFNet achieved 70.18% accuracy for motor imagery classification.
- It outperformed existing methods with 86.26% accuracy for mental arithmetic.
- The model demonstrated 81.13% accuracy for word generation tasks.

## Abstract

Unimodal brain–computer interfaces (BCIs) often suffer from inherent limitations due to the characteristic of using single modalities. While hybrid BCIs combining electroencephalography (EEG) and functional near-infrared spectroscopy (fNIRS) offer complementary advantages, effectively integrating their spatiotemporal features remains a challenge due to inherent signal asynchrony. This study aims to develop a novel deep fusion network to achieve synergistic integration of EEG and fNIRS signals for improved classification performance across different tasks. We propose a novel Temporal-Spatial Fusion Network (TSFNet), which consists of two key sublayers: the EEG-fNIRS-guided Fusion (EFGF) layer and the Cross-Attention-based Feature Enhancement (CAFÉ) layer. The EFGF layer extracts temporal features from EEG and spatial features from fNIRS to generate a hybrid attention map, which is utilized to achieve more effective and complementary integration of spatiotemporal information. The CAFÉ layer enables bidirectional interaction between fNIRS and fusion features via a cross-attention mechanism, which enhances the fusion features and selectively filters informative fNIRS representations. Through the two sublayers, TSFNet achieves deep fusion of multimodal features. Finally, TSFNet is evaluated on motor imagery (MI), mental arithmetic (MA), and word generation (WG) classification tasks. Experimental results demonstrate that TSFNet achieves superior classification performance, with average accuracies of 70.18% for MI, 86.26% for MA, and 81.13% for WG, outperforming existing state-of-the-art multimodal algorithms. These findings suggest that TSFNet provides an effective solution for spatiotemporal feature fusion in hybrid BCIs, with potential applications in real-world BCI systems.

## Full-text entities

- **Diseases:** injury to (MESH:D014947), MI (MESH:D000068079)
- **Chemicals:** Lcafe (-)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Figures

10 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12526594/full.md

---
Source: https://tomesphere.com/paper/PMC12526594