Emergence of foveal image sampling from learning to attend in visual   scenes

Brian Cheung; Eric Weiss; Bruno Olshausen

arXiv:1611.09430·cs.NE·October 24, 2017·20 cites

Emergence of foveal image sampling from learning to attend in visual scenes

Brian Cheung, Eric Weiss, Bruno Olshausen

PDF

Open Access

TL;DR

This paper introduces a neural attention model with a learnable retinal sampling lattice that, after training on a visual search task, naturally develops a fovea-like high-resolution center and low-resolution periphery, mirroring primate retinal structure.

Contribution

It demonstrates how a neural attention model can spontaneously develop a foveal sampling pattern similar to biological retinas through learning.

Findings

01

The model's retinal lattice resembles the primate retina's eccentricity-dependent sampling.

02

Emergent properties can be amplified or suppressed by changing training conditions.

03

The model effectively performs visual search with minimal fixations.

Abstract

We describe a neural attention model with a learnable retinal sampling lattice. The model is trained on a visual search task requiring the classification of an object embedded in a visual scene amidst background distractors using the smallest number of fixations. We explore the tiling properties that emerge in the model's retinal sampling lattice after training. Specifically, we show that this lattice resembles the eccentricity dependent sampling lattice of the primate retina, with a high resolution region in the fovea surrounded by a low resolution periphery. Furthermore, we find conditions where these emergent properties are amplified or eliminated providing clues to their function.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVisual Attention and Saliency Detection · Advanced Image and Video Retrieval Techniques · Image Enhancement Techniques