Fusing Foveal Fixations Using Linear Retinal Transformations and Bayesian Experimental Design

Christopher K. I. Williams

arXiv:2505.01249·cs.CV·November 20, 2025

Fusing Foveal Fixations Using Linear Retinal Transformations and Bayesian Experimental Design

Christopher K. I. Williams

PDF

Open Access

TL;DR

This paper introduces a model that fuses multiple scene fixations into a unified representation by explicitly modeling retinal transformations and optimizing gaze decisions through Bayesian experimental design, demonstrated on image datasets.

Contribution

It presents a novel approach combining linear retinal transformations with Bayesian experimental design for scene fusion, enabling exact inference and optimized gaze planning.

Findings

01

Effective scene fusion demonstrated on Frey faces and MNIST datasets.

02

Exact inference achieved through linear retinal transformation modeling.

03

Bayesian experimental design improves gaze selection for scene understanding.

Abstract

Humans (and many vertebrates) face the problem of fusing together multiple fixations of a scene in order to obtain a representation of the whole, where each fixation uses a high-resolution fovea and decreasing resolution in the periphery. In this paper we explicitly represent the retinal transformation of a fixation as a linear downsampling of a high-resolution latent image of the scene, exploiting the known geometry. This linear transformation allows us to carry out exact inference for the latent variables in factor analysis (FA) and mixtures of FA models of the scene. Further, this allows us to formulate and solve the choice of "where to look next" as a Bayesian experimental design problem using the Expected Information Gain criterion. Experiments on the Frey faces and MNIST datasets demonstrate the effectiveness of our models.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVisual perception and processing mechanisms · Ophthalmology and Visual Impairment Studies

MethodsFeedback Alignment