Structural Dropout for Model Width Compression
Julian Knodt

TL;DR
This paper introduces Structural Dropout, a method that learns feature importance in neural networks during a single training session, enabling efficient model compression at inference without retraining.
Contribution
It proposes a novel structural dropout technique that orders features by importance, allowing flexible inference-time pruning for model size reduction without additional training.
Findings
Enables significant parameter reduction with minimal accuracy loss.
Requires only one training session for original and compressed models.
Allows non-experts to trade off memory and accuracy at inference.
Abstract
Existing ML models are known to be highly over-parametrized, and use significantly more resources than required for a given task. Prior work has explored compressing models offline, such as by distilling knowledge from larger models into much smaller ones. This is effective for compression, but does not give an empirical method for measuring how much the model can be compressed, and requires additional training for each compressed model. We propose a method that requires only a single training session for the original model and a set of compressed models. The proposed approach is a "structural" dropout that prunes all elements in the hidden state above a randomly chosen index, forcing the model to learn an importance ordering over its features. After learning this ordering, at inference time unimportant features can be pruned while retaining most accuracy, reducing parameter size…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Machine Learning and Data Classification · Advanced Data Storage Technologies
MethodsDropout
