EEG-MFTNet: An Enhanced EEGNet Architecture with Multi-Scale Temporal Convolutions and Transformer Fusion for Cross-Session Motor Imagery Decoding
Panagiotis Andrikopoulos, Siamak Mehrkanoon

TL;DR
EEG-MFTNet is a deep learning model that enhances EEG-based motor imagery decoding by integrating multi-scale temporal convolutions and Transformer fusion, showing improved accuracy on cross-session EEG data.
Contribution
The paper introduces EEG-MFTNet, a novel architecture combining multi-scale convolutions and Transformer encoding to improve cross-session EEG motor imagery classification.
Findings
Achieves 58.9% accuracy on SHU dataset for MI decoding.
Outperforms baseline models like EEGNet in cross-session tests.
Maintains low computational complexity suitable for real-time BCI.
Abstract
Brain-computer interfaces (BCIs) enable direct communication between the brain and external devices, providing critical support for individuals with motor impairments. However, accurate motor imagery (MI) decoding from electroencephalography (EEG) remains challenging due to noise and cross-session variability. This study introduces EEG-MFTNet, a novel deep learning model based on the EEGNet architecture, enhanced with multi-scale temporal convolutions and a Transformer encoder stream. These components are designed to capture both short and long-range temporal dependencies in EEG signals. The model is evaluated on the SHU dataset using a subject-dependent cross-session setup, outperforming baseline models, including EEGNet and its recent derivatives. EEG-MFTNet achieves an average classification accuracy of 58.9% while maintaining low computational complexity and inference latency. The…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
