How does the primate brain combine generative and discriminative computations in vision?
Benjamin Peters, James J. DiCarlo, Todd Gureckis, Ralf Haefner, Leyla, Isik, Joshua Tenenbaum, Talia Konkle, Thomas Naselaris, Kimberly Stachenfeld,, Zenna Tavares, Doris Tsao, Ilker Yildirim, Nikolaus Kriegeskorte

TL;DR
This paper explores how the primate brain might integrate both feedforward discriminative and feedback generative processes in vision, aiming to unify two contrasting theories through empirical and theoretical analysis.
Contribution
It clarifies the terminology, reviews key evidence, and proposes a research program to uncover the hybrid algorithm in primate visual processing.
Findings
Review of empirical evidence supporting both theories
Proposal of a unified framework for vision processing
Identification of research directions to test hybrid models
Abstract
Vision is widely understood as an inference problem. However, two contrasting conceptions of the inference process have each been influential in research on biological vision as well as the engineering of machine vision. The first emphasizes bottom-up signal flow, describing vision as a largely feedforward, discriminative inference process that filters and transforms the visual information to remove irrelevant variation and represent behaviorally relevant information in a format suitable for downstream functions of cognition and behavioral control. In this conception, vision is driven by the sensory data, and perception is direct because the processing proceeds from the data to the latent variables of interest. The notion of "inference" in this conception is that of the engineering literature on neural networks, where feedforward convolutional neural networks processing images are said…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural dynamics and brain function · Face Recognition and Perception · Visual perception and processing mechanisms
