Energy-Efficient Visual Search by Eye Movement and Low-Latency Spiking   Neural Network

Yunhui Zhou; Dongqi Han; Yuguo Yu

arXiv:2310.06578·cs.NE·October 11, 2023

Energy-Efficient Visual Search by Eye Movement and Low-Latency Spiking Neural Network

Yunhui Zhou, Dongqi Han, Yuguo Yu

PDF

Open Access

TL;DR

This paper introduces a novel SNN-based visual search model inspired by human vision, combining eye movement strategies and energy-efficient processing, outperforming humans in speed and accuracy while providing insights into human search behavior.

Contribution

The first SNN-based visual search model integrating retina-like processing, eye movement, and population coding, demonstrating human-like and near-optimal strategies with superior performance.

Findings

01

Model learns human-like or near-optimal fixation strategies

02

Outperforms humans in search speed and accuracy

03

Achieves high energy efficiency with sparse activation

Abstract

Human vision incorporates non-uniform resolution retina, efficient eye movement strategy, and spiking neural network (SNN) to balance the requirements in visual field size, visual resolution, energy cost, and inference latency. These properties have inspired interest in developing human-like computer vision. However, existing models haven't fully incorporated the three features of human vision, and their learned eye movement strategies haven't been compared with human's strategy, making the models' behavior difficult to interpret. Here, we carry out experiments to examine human visual search behaviors and establish the first SNN-based visual search model. The model combines an artificial retina with spiking feature extraction, memory, and saccade decision modules, and it employs population coding for fast and efficient saccade decisions. The model can learn either a human-like or a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsCCD and CMOS Imaging Sensors · Advanced Memory and Neural Computing · Retinal Development and Disorders

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings