Fisher Task Distance and Its Application in Neural Architecture Search

Cat P. Le; Mohammadreza Soltani; Juncheng Dong; Vahid Tarokh

arXiv:2103.12827·cs.LG·May 3, 2022

Fisher Task Distance and Its Application in Neural Architecture Search

Cat P. Le, Mohammadreza Soltani, Juncheng Dong, Vahid Tarokh

PDF

1 Repo

TL;DR

This paper introduces Fisher task distance, a novel measure based on Fisher Information Matrices, to quantify task similarity and improve neural architecture search by leveraging related learned tasks, leading to reduced search space and better performance.

Contribution

The paper proposes a new Fisher task distance metric, proves its consistency, and applies it to enhance neural architecture search by utilizing task similarities to reduce search complexity.

Findings

01

Fisher task distance effectively measures task transfer complexity.

02

Using task similarity reduces neural architecture search space.

03

The approach outperforms existing gradient-based search methods.

Abstract

We formulate an asymmetric (or non-commutative) distance between tasks based on Fisher Information Matrices, called Fisher task distance. This distance represents the complexity of transferring the knowledge from one task to another. We provide a proof of consistency for our distance through theorems and experiments on various classification tasks from MNIST, CIFAR-10, CIFAR-100, ImageNet, and Taskonomy datasets. Next, we construct an online neural architecture search framework using the Fisher task distance, in which we have access to the past learned tasks. By using the Fisher task distance, we can identify the closest learned tasks to the target task, and utilize the knowledge learned from these related tasks for the target task. Here, we show how the proposed distance between a target task and a set of learned tasks can be used to reduce the neural architecture search space for the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

lephuoccat/Fisher-Information-NAS
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.