FOVI: A biologically-inspired foveated interface for deep vision models
Nicholas M. Blauch, George A. Alvarez, Talia Konkle

TL;DR
This paper introduces FOVI, a biologically-inspired foveated interface for deep vision models that improves efficiency by mimicking human eye movement and resolution variation, enabling scalable high-resolution vision processing.
Contribution
The paper presents a novel foveated vision interface (FOVI) that reformats variable-resolution sensor data into a dense, V1-like manifold, enabling efficient convolutional architectures and adaptation of existing models.
Findings
FOVI achieves competitive performance with reduced computational cost.
The approach enables scalable high-resolution egocentric vision.
Code and models are publicly available.
Abstract
Human vision is foveated, with variable resolution peaking at the center of a large field of view; this reflects an efficient trade-off for active sensing, allowing eye-movements to bring different parts of the world into focus with other parts of the world in context. In contrast, most computer vision systems encode the visual world at a uniform resolution, raising challenges for processing full-field high-resolution images efficiently. We propose a foveated vision interface (FOVI) based on the human retina and primary visual cortex, that reformats a variable-resolution retina-like sensor array into a uniformly dense, V1-like sensor manifold. Receptive fields are defined as k-nearest-neighborhoods (kNNs) on the sensor manifold, enabling kNN-convolution via a novel kernel mapping technique. We demonstrate two use cases: (1) an end-to-end kNN-convolutional architecture, and (2) a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVisual Attention and Saliency Detection · CCD and CMOS Imaging Sensors · Advanced Memory and Neural Computing
