Fast & Slow Learning: Incorporating Synthetic Gradients in Neural Memory Controllers
Tharindu Fernando, Simon Denman, Sridha Sridharan, Clinton Fookes

TL;DR
This paper introduces a novel neural memory network controller that incorporates synthetic gradients inspired by human neuromodulation, enabling rapid adaptation and improved performance in meta-learning tasks.
Contribution
It proposes a decoupled learning approach for NMN controllers using synthetic gradients, enhancing flexibility and quick adaptation in memory-based meta-learning scenarios.
Findings
Outperforms current state-of-the-art methods on multiple benchmarks.
Enables rapid learning and adaptation to new information.
Facilitates knowledge sharing among multiple networks.
Abstract
Neural Memory Networks (NMNs) have received increased attention in recent years compared to deep architectures that use a constrained memory. Despite their new appeal, the success of NMNs hinges on the ability of the gradient-based optimiser to perform incremental training of the NMN controllers, determining how to leverage their high capacity for knowledge retrieval. This means that while excellent performance can be achieved when the training data is consistent and well distributed, rare data samples are hard to learn from as the controllers fail to incorporate them effectively during model training. Drawing inspiration from the human cognition process, in particular the utilisation of neuromodulators in the human brain, we propose to decouple the learning process of the NMN controllers to allow them to achieve flexible, rapid adaptation in the presence of new information. This trait…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and ELM · Stochastic Gradient Optimization Techniques
