Leveraging Stochastic Depth Training for Adaptive Inference

Guilherme Korol; Antonio Carlos Schneider Beck; Jeronimo Castrillon

arXiv:2505.17626·cs.LG·May 26, 2025

Leveraging Stochastic Depth Training for Adaptive Inference

Guilherme Korol, Antonio Carlos Schneider Beck, Jeronimo Castrillon

PDF

TL;DR

This paper introduces a novel adaptive inference method leveraging stochastic depth training, enabling efficient, zero-overhead, and input-dependent model skipping with improved power efficiency and minimal accuracy loss.

Contribution

It proposes using stochastic depth-trained models to select optimal layer-skipping configurations for adaptive inference, simplifying existing techniques.

Findings

01

Up to 2X power efficiency improvement

02

Accuracy drops as low as 0.71%

03

Enhanced resilience to layer-skipping

Abstract

Dynamic DNN optimization techniques such as layer-skipping offer increased adaptability and efficiency gains but can lead to i) a larger memory footprint as in decision gates, ii) increased training complexity (e.g., with non-differentiable operations), and iii) less control over performance-quality trade-offs due to its inherent input-dependent execution. To approach these issues, we propose a simpler yet effective alternative for adaptive inference with a zero-overhead, single-model, and time-predictable inference. Central to our approach is the observation that models trained with Stochastic Depth -- a method for faster training of residual networks -- become more resilient to arbitrary layer-skipping at inference time. We propose a method to first select near Pareto-optimal skipping configurations from a stochastically-trained model to adapt the inference at runtime later. Compared…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsStochastic Depth