ParameterNet: Parameters Are All You Need

Kai Han; Yunhe Wang; Jianyuan Guo; Enhua Wu

arXiv:2306.14525·cs.CV·January 17, 2024·5 cites

ParameterNet: Parameters Are All You Need

Kai Han, Yunhe Wang, Jianyuan Guo, Enhua Wu

PDF

Open Access

TL;DR

ParameterNet introduces a method to increase model parameters using dynamic convolutions, enabling low-FLOPs models to benefit from large-scale pretraining in vision and language tasks, resulting in improved accuracy with minimal FLOP increase.

Contribution

The paper proposes ParameterNet, a novel design that augments parameters with minimal FLOP increase, enhancing pretraining benefits for low-FLOPs vision and language models.

Findings

01

ParameterNet achieves higher accuracy on ImageNet-22K with fewer FLOPs.

02

ParameterNet improves LLaMA-1B accuracy by 2% over vanilla LLaMA.

03

ParameterNet outperforms existing models in both vision and language domains.

Abstract

The large-scale visual pretraining has significantly improve the performance of large vision models. However, we observe the \emph{low FLOPs pitfall} that the existing low-FLOPs models cannot benefit from large-scale pretraining. In this paper, we introduce a novel design principle, termed ParameterNet, aimed at augmenting the number of parameters in large-scale visual pretraining models while minimizing the increase in FLOPs. We leverage dynamic convolutions to incorporate additional parameters into the networks with only a marginal rise in FLOPs. The ParameterNet approach allows low-FLOPs networks to take advantage of large-scale visual pretraining. Furthermore, we extend the ParameterNet concept to the language domain to enhance inference results while preserving inference speed. Experiments on the large-scale ImageNet-22K have shown the superiority of our ParameterNet scheme. For…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVisual Attention and Saliency Detection · Advanced Neural Network Applications · Advanced Image and Video Retrieval Techniques

MethodsMulti-Head Attention · Attention Is All You Need · Absolute Position Encodings · Linear Layer · Layer Normalization · Position-Wise Feed-Forward Layer · Stochastic Depth · Dense Connections · Label Smoothing · Adam