Neural Prompt Search

Yuanhan Zhang; Kaiyang Zhou; Ziwei Liu

arXiv:2206.04673·cs.CV·June 15, 2022

Neural Prompt Search

Yuanhan Zhang, Kaiyang Zhou, Ziwei Liu

PDF

1 Repo

TL;DR

This paper introduces NOAH, a neural architecture search method for designing optimal prompt modules in vision models, improving parameter-efficient tuning across diverse datasets with strong few-shot and domain generalization capabilities.

Contribution

It proposes a novel neural architecture search approach to automatically design prompt modules for vision models, reducing manual tuning efforts.

Findings

01

NOAH outperforms individual prompt modules on multiple datasets.

02

It demonstrates strong few-shot learning ability.

03

It generalizes well across different domains.

Abstract

The size of vision models has grown exponentially over the last few years, especially after the emergence of Vision Transformer. This has motivated the development of parameter-efficient tuning methods, such as learning adapter layers or visual prompt tokens, which allow a tiny portion of model parameters to be trained whereas the vast majority obtained from pre-training are frozen. However, designing a proper tuning method is non-trivial: one might need to try out a lengthy list of design choices, not to mention that each downstream dataset often requires custom designs. In this paper, we view the existing parameter-efficient tuning methods as "prompt modules" and propose Neural prOmpt seArcH (NOAH), a novel approach that learns, for large vision models, the optimal design of prompt modules through a neural architecture search algorithm, specifically for each downstream dataset. By…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ZhangYuanhan-AI/NOAH
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Label Smoothing · Softmax · Byte Pair Encoding · Adam · Dropout · Residual Connection · Dense Connections