Learning Features with Parameter-Free Layers
Dongyoon Han, YoungJoon Yoo, Beomyoung Kim, Byeongho Heo

TL;DR
This paper proposes replacing trainable layers with parameter-free operations like max-pool in neural networks to improve efficiency without significantly sacrificing accuracy, supported by extensive experiments on ImageNet.
Contribution
It introduces a novel approach of heavily utilizing parameter-free operations in network design, challenging the traditional reliance on trainable layers for spatial processing.
Findings
Parameter-free operations can replace trainable layers without major accuracy loss.
Networks with parameter-free layers show improved speed, fewer parameters, and reduced FLOPs.
Experimental results on ImageNet validate the efficiency benefits of the proposed design.
Abstract
Trainable layers such as convolutional building blocks are the standard network design choices by learning parameters to capture the global context through successive spatial operations. When designing an efficient network, trainable layers such as the depthwise convolution is the source of efficiency in the number of parameters and FLOPs, but there was little improvement to the model speed in practice. This paper argues that simple built-in parameter-free operations can be a favorable alternative to the efficient trainable layers replacing spatial operations in a network architecture. We aim to break the stereotype of organizing the spatial operations of building blocks into trainable layers. Extensive experimental analyses based on layer-level studies with fully-trained models and neural architecture searches are provided to investigate whether parameter-free operations such as the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and Data Classification · Advanced Neural Network Applications
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Depthwise Convolution · Convolution
