Distributed Evolution Strategies Using TPUs for Meta-Learning

Alex Sheng; Derek He

arXiv:2201.00093·cs.NE·January 4, 2022

Distributed Evolution Strategies Using TPUs for Meta-Learning

Alex Sheng, Derek He

PDF

TL;DR

This paper introduces a scalable distributed evolutionary meta-learning method using TPUs, achieving high accuracy on few-shot classification with significantly reduced memory usage compared to traditional backpropagation.

Contribution

It presents the first assessment of evolutionary meta-learning in supervised tasks and develops a general framework for distributed evolution strategies on TPUs.

Findings

01

Achieved 98.4% accuracy on 5-shot classification with Prototypical Networks.

02

Used up to 40 times less memory than automatic differentiation.

03

Observed classification accuracy up to 99.1% with larger populations.

Abstract

Meta-learning traditionally relies on backpropagation through entire tasks to iteratively improve a model's learning dynamics. However, this approach is computationally intractable when scaled to complex tasks. We propose a distributed evolutionary meta-learning strategy using Tensor Processing Units (TPUs) that is highly parallel and scalable to arbitrarily long tasks with no increase in memory cost. Using a Prototypical Network trained with evolution strategies on the Omniglot dataset, we achieved an accuracy of 98.4% on a 5-shot classification problem. Our algorithm used as much as 40 times less memory than automatic differentiation to compute the gradient, with the resulting model achieving accuracy within 1.3% of a backpropagation-trained equivalent (99.6%). We observed better classification accuracy as high as 99.1% with larger population configurations. We further experimentally…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.