Visual Fixation-Based Retinal Prosthetic Simulation
Yuli Wu, Do Dinh Tan Nguyen, Henning Konermann, R\"uveyda Yilmaz, Peter Walter, Johannes Stegmaier

TL;DR
This paper introduces a fixation-driven retinal prosthetic simulation framework that uses self-attention and end-to-end optimization to improve visual percepts and classification accuracy, outperforming traditional downsampling methods.
Contribution
The study presents a novel fixation-based simulation framework that integrates self-attention, trainable encoding, and percept prediction to enhance retinal prosthetic performance.
Findings
Achieved 87.72% classification accuracy on ImageNet subset.
Outperformed downsampling-based accuracy of 40.59%.
Approached healthy upper bound of 92.76%.
Abstract
This study proposes a retinal prosthetic simulation framework driven by visual fixations, inspired by the saccade mechanism, and assesses performance improvements through end-to-end optimization in a classification task. Salient patches are predicted from input images using the self-attention map of a vision transformer to mimic visual fixations. These patches are then encoded by a trainable U-Net and simulated using the pulse2percept framework to predict visual percepts. By incorporating a learnable encoder, we aim to optimize the visual information transmitted to the retinal implant, addressing both the limited resolution of the electrode array and the distortion between the input stimuli and resulting phosphenes. The predicted percepts are evaluated using the self-supervised DINOv2 foundation model, with an optional learnable linear layer for classification accuracy. On a subset of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeuroscience and Neural Engineering · Tactile and Sensory Interactions · EEG and Brain-Computer Interfaces
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Attention Is All You Need · Softmax · Multi-Head Attention · Dense Connections · Concatenated Skip Connection · Residual Connection · Layer Normalization · Vision Transformer · Linear Layer
