# Deep Anchored Convolutional Neural Networks

**Authors:** Jiahui Huang, Kshitij Dwivedi, Gemma Roig

arXiv: 1904.09764 · 2019-04-23

## TL;DR

This paper introduces Deep Anchored Convolutional Neural Networks (DACNNs), which share weights across layers to drastically reduce memory usage while maintaining high accuracy on vision tasks.

## Contribution

The paper proposes a novel CNN architecture with shared kernel weights across layers, enabling deep networks with minimal memory increase and introducing modules to enhance performance.

## Key findings

- Memory is reduced by a factor of L in deep DACNNs.
- High accuracy is maintained despite significant model compression.
- Validated on CIFAR-10, CIFAR-100, and SVHN datasets.

## Abstract

Convolutional Neural Networks (CNNs) have been proven to be extremely successful at solving computer vision tasks. State-of-the-art methods favor such deep network architectures for its accuracy performance, with the cost of having massive number of parameters and high weights redundancy. Previous works have studied how to prune such CNNs weights. In this paper, we go to another extreme and analyze the performance of a network stacked with a single convolution kernel across layers, as well as other weights sharing techniques. We name it Deep Anchored Convolutional Neural Network (DACNN). Sharing the same kernel weights across layers allows to reduce the model size tremendously, more precisely, the network is compressed in memory by a factor of L, where L is the desired depth of the network, disregarding the fully connected layer for prediction. The number of parameters in DACNN barely increases as the network grows deeper, which allows us to build deep DACNNs without any concern about memory costs. We also introduce a partial shared weights network (DACNN-mix) as well as an easy-plug-in module, coined regulators, to boost the performance of our architecture. We validated our idea on 3 datasets: CIFAR-10, CIFAR-100 and SVHN. Our results show that we can save massive amounts of memory with our model, while maintaining a high accuracy performance.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1904.09764/full.md

## Figures

7 figures with captions in the complete paper: https://tomesphere.com/paper/1904.09764/full.md

## References

33 references — full list in the complete paper: https://tomesphere.com/paper/1904.09764/full.md

---
Source: https://tomesphere.com/paper/1904.09764