Pixels to Voxels: Modeling Visual Representation in the Human Brain
Pulkit Agrawal, Dustin Stansbury, Jitendra Malik, Jack L. Gallant

TL;DR
This paper introduces models based on Fisher Vectors and Convolutional Neural Networks that can predict human brain activity directly from raw pixel input, bypassing the need for semantic annotations.
Contribution
Developed the first models that predict high-level visual brain activity directly from pixels using computer vision and machine learning techniques.
Findings
Both FV and ConvNet models accurately predict brain activity in high-level visual areas.
Models operate directly on pixel data without semantic tags or annotations.
This approach offers a new platform for understanding human visual processing.
Abstract
The human brain is adept at solving difficult high-level visual processing problems such as image interpretation and object recognition in natural scenes. Over the past few years neuroscientists have made remarkable progress in understanding how the human brain represents categories of objects and actions in natural scenes. However, all current models of high-level human vision operate on hand annotated images in which the objects and actions have been assigned semantic tags by a human operator. No current models can account for high-level visual function directly in terms of low-level visual input (i.e., pixels). To overcome this fundamental limitation we sought to develop a new class of models that can predict human brain activity directly from low-level visual input (i.e., pixels). We explored two classes of models drawn from computer vision and machine learning. The first class of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVisual Attention and Saliency Detection · Face Recognition and Perception · Advanced Neural Network Applications
