Neural Attentive Circuits
Nasim Rahaman, Martin Weiss, Francesco Locatello, Chris Pal, and Yoshua Bengio, Bernhard Sch\"olkopf, Li Erran Li, Nicolas Ballas

TL;DR
Neural Attentive Circuits (NACs) are a modular neural architecture that jointly learns module configurations and connectivity, improving low-shot adaptation, out-of-distribution robustness, and inference speed across diverse data modalities.
Contribution
This work introduces NACs, a novel general purpose neural architecture that learns module parameterization and sparse connectivity end-to-end without domain knowledge.
Findings
NACs improve low-shot adaptation by about 10% on CIFAR and CUBs.
NACs enhance out-of-distribution robustness by approximately 2.5% on Tiny ImageNet-R.
NACs achieve up to 8x inference speedup with minimal performance loss.
Abstract
Recent work has seen the development of general purpose neural architectures that can be trained to perform tasks across diverse data modalities. General purpose models typically make few assumptions about the underlying data-structure and are known to perform well in the large-data regime. At the same time, there has been growing interest in modular neural architectures that represent the data using sparsely interacting modules. These models can be more robust out-of-distribution, computationally efficient, and capable of sample-efficient adaptation to new data. However, they tend to make domain-specific assumptions about the data, and present challenges in how module behavior (i.e., parameterization) and connectivity (i.e., their layout) can be jointly learned. In this work, we introduce a general purpose, yet modular neural architecture called Neural Attentive Circuits (NACs) that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsFerroelectric and Negative Capacitance Devices · Advanced Memory and Neural Computing · Machine Learning and ELM
