Task-Driven Convolutional Recurrent Models of the Visual System

Aran Nayebi; Daniel Bear; Jonas Kubilius; Kohitij Kar; Surya Ganguli,; David Sussillo; James J. DiCarlo; Daniel L. K. Yamins

arXiv:1807.00053·q-bio.NC·October 30, 2018·121 cites

Task-Driven Convolutional Recurrent Models of the Visual System

Aran Nayebi, Daniel Bear, Jonas Kubilius, Kohitij Kar, Surya Ganguli,, David Sussillo, James J. DiCarlo, Daniel L. K. Yamins

PDF

Open Access 1 Repo

TL;DR

This paper investigates how recurrence in convolutional neural networks can improve object recognition and better emulate neural activity in the primate visual system, introducing novel recurrent cell structures and feedback mechanisms.

Contribution

It introduces new recurrent cell designs with bypassing and gating, and demonstrates their effectiveness in enhancing CNN performance and neural plausibility.

Findings

01

Recurrent CNNs with novel cells outperform standard CNNs on ImageNet.

02

Task-optimized recurrent models better match primate neural dynamics.

03

Long-range feedback and local recurrence improve object recognition.

Abstract

Feed-forward convolutional neural networks (CNNs) are currently state-of-the-art for object classification tasks such as ImageNet. Further, they are quantitatively accurate models of temporally-averaged responses of neurons in the primate brain's visual system. However, biological visual systems have two ubiquitous architectural features not shared with typical CNNs: local recurrence within cortical areas, and long-range feedback from downstream areas to upstream areas. Here we explored the role of recurrence in improving classification performance. We found that standard forms of recurrence (vanilla RNNs and LSTMs) do not perform well within deep CNNs on the ImageNet task. In contrast, novel cells that incorporated two structural features, bypassing and gating, were able to boost task accuracy substantially. We extended these design principles in an automated search over thousands of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

neuroailab/tnn
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural dynamics and brain function · Visual perception and processing mechanisms · Face Recognition and Perception