Analyzing the Performance of Smart Industry 4.0 Applications on Cloud Computing Systems
Razin Farhan Hussain, Alireza Pakravan, Mohsen Amini Salehi

TL;DR
This paper analyzes the stochastic inference times of DNN applications in cloud environments, providing statistical models and confidence intervals to improve robustness and QoS in Industry 4.0 applications.
Contribution
It offers a dual perspective analysis—application-centric and resource-centric—of DNN inference time variability on cloud platforms using non-parametric statistical methods.
Findings
Inference times are highly stochastic due to multi-tenancy and resource heterogeneity.
Statistical models and confidence intervals for inference time and MIPS are developed.
Insights help improve robustness and QoS in cloud-based Industry 4.0 applications.
Abstract
Cloud-based Deep Neural Network (DNN) applications that make latency-sensitive inference are becoming an indispensable part of Industry 4.0. Due to the multi-tenancy and resource heterogeneity, both inherent to the cloud computing environments, the inference time of DNN-based applications are stochastic. Such stochasticity, if not captured, can potentially lead to low Quality of Service (QoS) or even a disaster in critical sectors, such as Oil and Gas industry. To make Industry 4.0 robust, solution architects and researchers need to understand the behavior of DNN-based applications and capture the stochasticity exists in their inference times. Accordingly, in this study, we provide a descriptive analysis of the inference time from two perspectives. First, we perform an application-centric analysis and statistically model the execution time of four categorically different DNN…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
