Tiny Deep Ensemble: Uncertainty Estimation in Edge AI Accelerators via Ensembling Normalization Layers with Shared Weights
Soyed Tuhin Ahmed, Michael Hefenbrock, Mehdi B. Tahoori

TL;DR
This paper introduces Tiny-Deep Ensemble, a low-cost uncertainty estimation method for edge AI that ensem bles normalization layers with shared weights, significantly reducing hardware overhead while maintaining high accuracy.
Contribution
The paper proposes a novel ensemble approach that ensembles only normalization layers with shared weights, drastically reducing memory and latency overhead for uncertainty estimation on edge devices.
Findings
Reduces latency and memory overhead by up to ~M times.
Maintains or improves inference accuracy by up to ~1%.
Achieves a 17.17% reduction in RMSE across benchmarks.
Abstract
The applications of artificial intelligence (AI) are rapidly evolving, and they are also commonly used in safety-critical domains, such as autonomous driving and medical diagnosis, where functional safety is paramount. In AI-driven systems, uncertainty estimation allows the user to avoid overconfidence predictions and achieve functional safety. Therefore, the robustness and reliability of model predictions can be improved. However, conventional uncertainty estimation methods, such as the deep ensemble method, impose high computation and, accordingly, hardware (latency and energy) overhead because they require the storage and processing of multiple models. Alternatively, Monte Carlo dropout (MC-dropout) methods, although having low memory overhead, necessitate numerous () forward passes, leading to high computational overhead and latency. Thus, these approaches are not suitable…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCCD and CMOS Imaging Sensors · Anomaly Detection Techniques and Applications · Adversarial Robustness in Machine Learning
MethodsDropout · Monte Carlo Dropout
