Efficient coding along the visual hierarchy

Ananya Passi; Brian S. Robinson; and Michael F. Bonner

arXiv:2605.19155·cs.CV·May 20, 2026

Efficient coding along the visual hierarchy

Ananya Passi, Brian S. Robinson, and Michael F. Bonner

PDF

TL;DR

This paper demonstrates that an unsupervised efficient coding approach can develop human-aligned visual features from limited data, aligning with neural responses and improving low-data learning.

Contribution

It introduces a hierarchical unsupervised learning method based on efficient coding that captures natural image statistics without labels or backpropagation.

Findings

01

Features progress from edges to textures and shapes.

02

Features are recognizable by humans and predict fMRI responses.

03

Hybrid training improves brain alignment and category learning in low-data scenarios.

Abstract

Biological visual systems learn from limited experience, unlike deep learning models that rely on millions of training images. What learning principles make this possible? We tested whether efficient coding, the idea that neural representations capture the statistical structure of natural inputs, can build a hierarchy of human-aligned visual features from limited data. We developed an unsupervised learning procedure in which each layer of a deep network compresses its inputs onto the dominant modes of variation in natural images, using only local statistics and no labels, tasks, or backpropagation. This unsupervised procedure yields features that progress from edges and colors to textures and shapes. The features of this deep efficient coding model are readily recognized by human observers and are predictive of image-evoked fMRI responses in human visual cortex. Furthermore, a hybrid…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.