Structured Transforms for Small-Footprint Deep Learning
Vikas Sindhwani, Tara N. Sainath, Sanjiv Kumar

TL;DR
This paper introduces a unified framework for structured transforms in deep learning that enables compact models suitable for mobile devices, significantly improving inference speed and model compression while maintaining high accuracy.
Contribution
A novel framework for learning structured parameter matrices with low displacement rank, balancing model complexity and efficiency for mobile deep learning applications.
Findings
Accelerates inference and training passes.
Achieves over 3.5-fold model compression in speech recognition.
Maintains near state-of-the-art accuracy with structured transforms.
Abstract
We consider the task of building compact deep learning pipelines suitable for deployment on storage and power constrained mobile devices. We propose a unified framework to learn a broad family of structured parameter matrices that are characterized by the notion of low displacement rank. Our structured transforms admit fast function and gradient evaluation, and span a rich range of parameter sharing configurations whose statistical modeling capacity can be explicitly tuned along a continuum from structured to unstructured. Experimental results show that these transforms can significantly accelerate inference and forward/backward passes during training, and offer superior accuracy-compactness-speed tradeoffs in comparison to a number of existing techniques. In keyword spotting applications in mobile speech recognition, our methods are much more effective than standard linear low-rank…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and Audio Processing · Speech Recognition and Synthesis · Music and Audio Processing
