Learning Structure and Strength of CNN Filters for Small Sample Size   Training

Rohit Keshari; Mayank Vatsa; Richa Singh; Afzel Noore

arXiv:1803.11405·cs.CV·April 2, 2018

Learning Structure and Strength of CNN Filters for Small Sample Size Training

Rohit Keshari, Mayank Vatsa, Richa Singh, Afzel Noore

PDF

TL;DR

This paper introduces SSF-CNN, a novel approach that learns filter structure and strength to enable effective training of CNNs with small datasets, achieving high accuracy and reducing parameters.

Contribution

The paper proposes a new CNN training method that initializes filter structure via dictionary learning and learns filter strength from limited data, improving small sample performance.

Findings

01

SSF-CNN reduces parameter count significantly.

02

Achieves high accuracy on small datasets like newborn face recognition.

03

Outperforms existing methods with at least 10% accuracy improvement.

Abstract

Convolutional Neural Networks have provided state-of-the-art results in several computer vision problems. However, due to a large number of parameters in CNNs, they require a large number of training samples which is a limiting factor for small sample size problems. To address this limitation, we propose SSF-CNN which focuses on learning the structure and strength of filters. The structure of the filter is initialized using a dictionary-based filter learning algorithm and the strength of the filter is learned using the small sample training data. The architecture provides the flexibility of training with both small and large training databases and yields good accuracies even with small size training data. The effectiveness of the algorithm is first demonstrated on MNIST, CIFAR10, and NORB databases, with a varying number of training samples. The results show that SSF-CNN significantly…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.