NeuroSketch: An Effective Framework for Neural Decoding via Systematic Architectural Optimization
Gaorui Zhang, Zhizhang Yuan, Jialan Yang, Junru Chen, Li Meng, Yang Yang

TL;DR
NeuroSketch is a systematic architecture optimization framework that significantly improves neural decoding performance across multiple modalities and tasks, setting new state-of-the-art results.
Contribution
This study introduces NeuroSketch, a novel framework for neural decoding that systematically optimizes model architectures, demonstrating its effectiveness across diverse datasets and modalities.
Findings
CNN-2D outperforms other architectures in neural decoding
Performance improves with macro- to micro-level architecture optimization
NeuroSketch achieves state-of-the-art results across all evaluated datasets
Abstract
Neural decoding, a critical component of Brain-Computer Interface (BCI), has recently attracted increasing research interest. Previous research has focused on leveraging signal processing and deep learning methods to enhance neural decoding performance. However, the in-depth exploration of model architectures remains underexplored, despite its proven effectiveness in other tasks such as energy forecasting and image classification. In this study, we propose NeuroSketch, an effective framework for neural decoding via systematic architecture optimization. Starting with the basic architecture study, we find that CNN-2D outperforms other architectures in neural decoding tasks and explore its effectiveness from temporal and spatial perspectives. Building on this, we optimize the architecture from macro- to micro-level, achieving improvements in performance at each step. The exploration…
Peer Reviews
Decision·Submitted to ICLR 2026
1. Systematic Architectural Exploration with a Clear Optimization Path: The paper begins with a comparison of basic architectures (CNN-1D/2D, GRU, Transformer, etc.) and progressively delves into macro (latent space transformation) and micro (convolution operation optimization) levels of design. This forms a complete and logically rigorous optimization roadmap, offering high interpretability and methodological value. 2. Large-Scale, Multi-Modal Experimental Validation: Extensive validation was
1. Lack of Discussion on Neurophysiological Interpretability: Although the model performs excellently, the paper does not deeply analyze whether the neural representations learned by NeuroSketch are interpretable from a neuroscience perspective (e.g., correspondence to brain region activation or cognitive processes). This is an important dimension in BCI research. 2. Insufficient Comparison with Some Existing Neural Decoding-Specific Models: While comparisons are made against several general ti
The paper is well-structured and it follows a nice step-by-step analysis on how specific details were implemented in the final architecture. It’s the first time I see an analysis like this one. Most papers just introduce a new architecture without any design justifications.
The paper fails to provide more comparisons with better models (deep learning and foundation models). In addition, although thorough the analysis does not provide interpretable insights behind the choices. Writing: Paper is well-written and good structured. Overall: The paper shows some merits but it would be vital to have my questions answered.
1. This study presents a comprehensive evaluation, validating the proposed model across eight datasets and three distinct recording modalities: EEG, sEEG, and ECoG. 2. After extensive tuning of model hyperparameters, the model surpasses other models. 3. The authors provide open-source reproducibility commitments and dataset transparency.
1. The implementation details of baselines are missing. The performance of Conformer on Du-IN in Table 4 significantly underperforms the reported value in the Du-IN paper. We need specific details of the baseline implementation to ensure the reliability of the conclusions of the overall work and fair comparison. 2. The CNN-2D introduces translation invariance, which seems to be uncommon in brain signal modeling. It would be useful to include more analysis about the features obtained by the mode
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEEG and Brain-Computer Interfaces · Neural dynamics and brain function · Emotion and Mood Recognition
