X-CNN: Cross-modal Convolutional Neural Networks for Sparse Datasets

Petar Veli\v{c}kovi\'c; Duo Wang; Nicholas D. Lane; Pietro Li\`o

arXiv:1610.00163·stat.ML·September 26, 2017

X-CNN: Cross-modal Convolutional Neural Networks for Sparse Datasets

Petar Veli\v{c}kovi\'c, Duo Wang, Nicholas D. Lane, Pietro Li\`o

PDF

TL;DR

This paper introduces X-CNNs, a biologically inspired neural network architecture that enhances learning from sparse datasets by enabling information exchange between specialized subnetworks, outperforming traditional CNNs in low-data scenarios.

Contribution

The paper proposes a novel cross-modal CNN architecture that allows for unconstrained information flow between subnetworks, improving performance on sparse datasets.

Findings

01

X-CNNs outperform standard CNNs on CIFAR-10 and CIFAR-100 with limited training data.

02

Cross-modal architecture provides a 2-6% accuracy improvement in low-data environments.

03

X-CNNs maintain competitive performance even with full datasets.

Abstract

In this paper we propose cross-modal convolutional neural networks (X-CNNs), a novel biologically inspired type of CNN architectures, treating gradient descent-specialised CNNs as individual units of processing in a larger-scale network topology, while allowing for unconstrained information flow and/or weight sharing between analogous hidden layers of the network---thus generalising the already well-established concept of neural network ensembles (where information typically may flow only between the output layers of the individual networks). The constituent networks are individually designed to learn the output function on their own subset of the input data, after which cross-connections between them are introduced after each pooling operation to periodically allow for information exchange between them. This injection of knowledge into a model (by prior partition of the input data…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.