Learning Features with Parameter-Free Layers

Dongyoon Han; YoungJoon Yoo; Beomyoung Kim; Byeongho Heo

arXiv:2202.02777·cs.CV·March 22, 2022·5 cites

Learning Features with Parameter-Free Layers

Dongyoon Han, YoungJoon Yoo, Beomyoung Kim, Byeongho Heo

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper proposes replacing trainable layers with parameter-free operations like max-pool in neural networks to improve efficiency without significantly sacrificing accuracy, supported by extensive experiments on ImageNet.

Contribution

It introduces a novel approach of heavily utilizing parameter-free operations in network design, challenging the traditional reliance on trainable layers for spatial processing.

Findings

01

Parameter-free operations can replace trainable layers without major accuracy loss.

02

Networks with parameter-free layers show improved speed, fewer parameters, and reduced FLOPs.

03

Experimental results on ImageNet validate the efficiency benefits of the proposed design.

Abstract

Trainable layers such as convolutional building blocks are the standard network design choices by learning parameters to capture the global context through successive spatial operations. When designing an efficient network, trainable layers such as the depthwise convolution is the source of efficiency in the number of parameters and FLOPs, but there was little improvement to the model speed in practice. This paper argues that simple built-in parameter-free operations can be a favorable alternative to the efficient trainable layers replacing spatial operations in a network architecture. We aim to break the stereotype of organizing the spatial operations of building blocks into trainable layers. Extensive experimental analyses based on layer-level studies with fully-trained models and neural architecture searches are provided to investigate whether parameter-free operations such as the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

naver-ai/pflayer
pytorchOfficial

Videos

Learning Features with Parameter-Free Layers· slideslive

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and Data Classification · Advanced Neural Network Applications

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Depthwise Convolution · Convolution