Task-Driven Convolutional Recurrent Models of the Visual System
Aran Nayebi, Daniel Bear, Jonas Kubilius, Kohitij Kar, Surya Ganguli,, David Sussillo, James J. DiCarlo, Daniel L. K. Yamins

TL;DR
This paper investigates how recurrence in convolutional neural networks can improve object recognition and better emulate neural activity in the primate visual system, introducing novel recurrent cell structures and feedback mechanisms.
Contribution
It introduces new recurrent cell designs with bypassing and gating, and demonstrates their effectiveness in enhancing CNN performance and neural plausibility.
Findings
Recurrent CNNs with novel cells outperform standard CNNs on ImageNet.
Task-optimized recurrent models better match primate neural dynamics.
Long-range feedback and local recurrence improve object recognition.
Abstract
Feed-forward convolutional neural networks (CNNs) are currently state-of-the-art for object classification tasks such as ImageNet. Further, they are quantitatively accurate models of temporally-averaged responses of neurons in the primate brain's visual system. However, biological visual systems have two ubiquitous architectural features not shared with typical CNNs: local recurrence within cortical areas, and long-range feedback from downstream areas to upstream areas. Here we explored the role of recurrence in improving classification performance. We found that standard forms of recurrence (vanilla RNNs and LSTMs) do not perform well within deep CNNs on the ImageNet task. In contrast, novel cells that incorporated two structural features, bypassing and gating, were able to boost task accuracy substantially. We extended these design principles in an automated search over thousands of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural dynamics and brain function · Visual perception and processing mechanisms · Face Recognition and Perception
