An Empirical Investigation of Model-to-Model Distribution Shifts in   Trained Convolutional Filters

Paul Gavrikov; Janis Keuper

arXiv:2201.08465·cs.CV·January 24, 2022

An Empirical Investigation of Model-to-Model Distribution Shifts in Trained Convolutional Filters

Paul Gavrikov, Janis Keuper

PDF

Open Access 1 Repo

TL;DR

This paper empirically investigates distribution shifts in trained convolutional filters across various models and datasets, providing a large dataset and insights into how these shifts relate to model generalization and transfer learning.

Contribution

It introduces a large dataset of trained CNN filters and analyzes distribution shifts in these filters across different axes, offering new insights into model robustness and transfer learning.

Findings

01

Distribution shifts vary with data type, task, architecture, and layer depth.

02

Some filters show significant distribution shifts, others remain stable.

03

The properties of filter distributions can inform robustness and transfer learning strategies.

Abstract

We present first empirical results from our ongoing investigation of distribution shifts in image data used for various computer vision tasks. Instead of analyzing the original training and test data, we propose to study shifts in the learned weights of trained models. In this work, we focus on the properties of the distributions of dominantly used 3x3 convolution filter kernels. We collected and publicly provide a data set with over half a billion filters from hundreds of trained CNNs, using a wide range of data sets, architectures, and vision tasks. Our analysis shows interesting distribution shifts (or the lack thereof) between trained filters along different axes of meta-parameters, like data type, task, architecture, or layer depth. We argue, that the observed properties are a valuable source for further investigation into a better understanding of the impact of shifts in the input…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

paulgavrikov/cnn-filter-db
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Neural Network Applications · Domain Adaptation and Few-Shot Learning

MethodsConvolution