Deep Networks with Adaptive Nystr\"om Approximation

Luc Giffon (QARMA; LIS); St\'ephane Ayache (QARMA; LIS); Thierry; Arti\`eres (QARMA; ECM; LIS); Hachem Kadri (QARMA; LIS)

arXiv:1911.13036·cs.LG·December 2, 2019

Deep Networks with Adaptive Nystr\"om Approximation

Luc Giffon (QARMA, LIS), St\'ephane Ayache (QARMA, LIS), Thierry, Arti\`eres (QARMA, ECM, LIS), Hachem Kadri (QARMA, LIS)

PDF

Open Access

TL;DR

This paper introduces a flexible neural network architecture that replaces dense layers with a Nyström kernel approximation, maintaining performance while reducing parameters, especially effective with small datasets.

Contribution

It presents a novel neural network design integrating Nyström kernel approximation, compatible with multiple kernels and suitable for small training datasets.

Findings

01

Achieves comparable performance to standard architectures on SVHN and CIFAR100.

02

Has fewer learnable parameters, beneficial for small datasets.

03

Compatible with any kernel function and multiple kernels.

Abstract

Recent work has focused on combining kernel methods and deep learning to exploit the best of the two approaches. Here, we introduce a new architecture of neural networks in which we replace the top dense layers of standard convolutional architectures with an approximation of a kernel function by relying on the Nystr{\"o}m approximation. Our approach is easy and highly flexible. It is compatible with any kernel function and it allows exploiting multiple kernels. We show that our architecture has the same performance than standard architecture on datasets like SVHN and CIFAR100. One benefit of the method lies in its limited number of learnable parameters which makes it particularly suited for small training set sizes, e.g. from 5 to 20 samples per class.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEnergy Load and Power Forecasting · Tensor decomposition and applications · Model Reduction and Neural Networks