Utilizing Excess Resources in Training Neural Networks

Amit Henig; Raja Giryes

arXiv:2207.05532·cs.LG·July 13, 2022

Utilizing Excess Resources in Training Neural Networks

Amit Henig, Raja Giryes

PDF

Open Access 1 Repo

TL;DR

This paper introduces KFLO, a kernel filtering method that enhances neural network training by overparameterization without increasing inference complexity, applicable to various architectures and datasets.

Contribution

The paper proposes a novel kernel filtering linear overparameterization technique that improves training performance while maintaining low inference complexity.

Findings

01

KFLO improves test performance across multiple models.

02

The approach is compatible with various network architectures.

03

No additional inference cost is introduced.

Abstract

In this work, we suggest Kernel Filtering Linear Overparameterization (KFLO), where a linear cascade of filtering layers is used during training to improve network performance in test time. We implement this cascade in a kernel filtering fashion, which prevents the trained architecture from becoming unnecessarily deeper. This also allows using our approach with almost any network architecture and let combining the filtering layers into a single layer in test time. Thus, our approach does not add computational complexity during inference. We demonstrate the advantage of KFLO on various network models and datasets in supervised learning.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

amithenig/kflo
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Human Pose and Action Recognition · Gaussian Processes and Bayesian Inference

MethodsTest