Improving Unsupervised Task-driven Models of Ventral Visual Stream via Relative Position Predictivity

Dazhong Rong; Hao Dong; Xing Gao; Jiyu Wei; Di Hong; Yaoyao Hao; Qinming He; Yueming Wang

arXiv:2505.08316·cs.CE·November 11, 2025

Improving Unsupervised Task-driven Models of Ventral Visual Stream via Relative Position Predictivity

Dazhong Rong, Hao Dong, Xing Gao, Jiyu Wei, Di Hong, Yaoyao Hao, Qinming He, Yueming Wang

PDF

1 Repo

TL;DR

This paper introduces a novel unsupervised learning method that combines contrastive learning with relative position prediction to better model the ventral visual stream, improving object recognition and brain similarity.

Contribution

It proposes integrating relative position prediction with contrastive learning, addressing limitations of existing models and aligning more closely with biological VVS functions.

Findings

01

Enhanced downstream object recognition performance

02

Improved relative position predictivity

03

Increased brain similarity of models

Abstract

Based on the concept that ventral visual stream (VVS) mainly functions for object recognition, current unsupervised task-driven methods model VVS by contrastive learning, and have achieved good brain similarity. However, we believe functions of VVS extend beyond just object recognition. In this paper, we introduce an additional function involving VVS, named relative position (RP) prediction. We first theoretically explain contrastive learning may be unable to yield the model capability of RP prediction. Motivated by this, we subsequently integrate RP learning with contrastive learning, and propose a new unsupervised task-driven method to model VVS, which is more inline with biological reality. We conduct extensive experiments, demonstrating that: (i) our method significantly improves downstream performance of object recognition while enhancing RP predictivity; (ii) RP predictivity…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

rdz98/unsup-vvs
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsContrastive Learning