Structured Transforms for Small-Footprint Deep Learning

Vikas Sindhwani; Tara N. Sainath; Sanjiv Kumar

arXiv:1510.01722·stat.ML·October 7, 2015·103 cites

Structured Transforms for Small-Footprint Deep Learning

Vikas Sindhwani, Tara N. Sainath, Sanjiv Kumar

PDF

Open Access

TL;DR

This paper introduces a unified framework for structured transforms in deep learning that enables compact models suitable for mobile devices, significantly improving inference speed and model compression while maintaining high accuracy.

Contribution

A novel framework for learning structured parameter matrices with low displacement rank, balancing model complexity and efficiency for mobile deep learning applications.

Findings

01

Accelerates inference and training passes.

02

Achieves over 3.5-fold model compression in speech recognition.

03

Maintains near state-of-the-art accuracy with structured transforms.

Abstract

We consider the task of building compact deep learning pipelines suitable for deployment on storage and power constrained mobile devices. We propose a unified framework to learn a broad family of structured parameter matrices that are characterized by the notion of low displacement rank. Our structured transforms admit fast function and gradient evaluation, and span a rich range of parameter sharing configurations whose statistical modeling capacity can be explicitly tuned along a continuum from structured to unstructured. Experimental results show that these transforms can significantly accelerate inference and forward/backward passes during training, and offer superior accuracy-compactness-speed tradeoffs in comparison to a number of existing techniques. In keyword spotting applications in mobile speech recognition, our methods are much more effective than standard linear low-rank…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and Audio Processing · Speech Recognition and Synthesis · Music and Audio Processing