SIMON: Saliency-aware Integrative Multi-view Object-centric Neural Decoding

YuSheng Lin; Ji-Hwa Tsai; Chun-Shu Wei

arXiv:2605.00401·cs.CV·May 4, 2026

SIMON: Saliency-aware Integrative Multi-view Object-centric Neural Decoding

YuSheng Lin, Ji-Hwa Tsai, Chun-Shu Wei

PDF

1 Repo

TL;DR

SIMON introduces a saliency-aware multi-view framework for zero-shot EEG-to-image retrieval, emphasizing informative object regions to improve accuracy over existing methods.

Contribution

It proposes a novel saliency-aware sampling approach that enhances EEG-to-image retrieval by integrating multi-view visual features aligned with human attention.

Findings

01

Achieves state-of-the-art Top-1 accuracy of 69.7% intra-subject.

02

Outperforms recent baselines in inter-subject retrieval.

03

Demonstrates robustness across sampling granularity and encoder backbones.

Abstract

Recent EEG-to-image retrieval methods leverage pretrained vision encoders and foveation-inspired priors, but typically assume a fixed, center-focused view. This center bias conflicts with content-driven human attention, creating a geometric-semantic dissociation between visual features and EEG responses. We propose SIMON, a saliency-aware multi-view framework for zero-shot EEG-to-image retrieval. SIMON combines foreground segmentation and saliency prediction to select fixation centers via Saliency-Aware Sampling (SAS), then generates foveated views that emphasize informative object regions while suppressing background clutter. On THINGS-EEG, SIMON achieves state-of-the-art performance in both intra-subject and inter-subject settings, reaching an average Top-1 accuracy of 69.7% and 19.6%, respectively, consistently outperforming recent competitive baselines. Analyses across sampling…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

simonlink666/SIMON
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.