Reduced-Order Modeling of Deep Neural Networks

Julia Gusak; Talgat Daulbaev; Evgeny Ponomarev; Andrzej Cichocki; Ivan; Oseledets

arXiv:1910.06995·cs.LG·November 26, 2020

Reduced-Order Modeling of Deep Neural Networks

Julia Gusak, Talgat Daulbaev, Evgeny Ponomarev, Andrzej Cichocki, Ivan, Oseledets

PDF

TL;DR

This paper presents a novel reduced-order modeling approach inspired by dynamical systems to accelerate deep neural network inference by replacing convolutional layers with smaller fully-connected layers, maintaining accuracy.

Contribution

It introduces a maximum volume algorithm-based method for simplifying neural networks, demonstrating significant speedups with minimal accuracy loss.

Findings

01

Smaller fully-connected layers can replace convolutional layers effectively.

02

The method achieves speedups on various datasets.

03

Minimal accuracy drop observed in practical cases.

Abstract

We introduce a new method for speeding up the inference of deep neural networks. It is somewhat inspired by the reduced-order modeling techniques for dynamical systems.The cornerstone of the proposed method is the maximum volume algorithm. We demonstrate efficiency on neural networks pre-trained on different datasets. We show that in many practical cases it is possible to replace convolutional layers with much smaller fully-connected layers with a relatively small drop in accuracy.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.