Tiny Deep Ensemble: Uncertainty Estimation in Edge AI Accelerators via   Ensembling Normalization Layers with Shared Weights

Soyed Tuhin Ahmed; Michael Hefenbrock; Mehdi B. Tahoori

arXiv:2405.05286·cs.LG·May 12, 2024

Tiny Deep Ensemble: Uncertainty Estimation in Edge AI Accelerators via Ensembling Normalization Layers with Shared Weights

Soyed Tuhin Ahmed, Michael Hefenbrock, Mehdi B. Tahoori

PDF

Open Access

TL;DR

This paper introduces Tiny-Deep Ensemble, a low-cost uncertainty estimation method for edge AI that ensem bles normalization layers with shared weights, significantly reducing hardware overhead while maintaining high accuracy.

Contribution

The paper proposes a novel ensemble approach that ensembles only normalization layers with shared weights, drastically reducing memory and latency overhead for uncertainty estimation on edge devices.

Findings

01

Reduces latency and memory overhead by up to ~M times.

02

Maintains or improves inference accuracy by up to ~1%.

03

Achieves a 17.17% reduction in RMSE across benchmarks.

Abstract

The applications of artificial intelligence (AI) are rapidly evolving, and they are also commonly used in safety-critical domains, such as autonomous driving and medical diagnosis, where functional safety is paramount. In AI-driven systems, uncertainty estimation allows the user to avoid overconfidence predictions and achieve functional safety. Therefore, the robustness and reliability of model predictions can be improved. However, conventional uncertainty estimation methods, such as the deep ensemble method, impose high computation and, accordingly, hardware (latency and energy) overhead because they require the storage and processing of multiple models. Alternatively, Monte Carlo dropout (MC-dropout) methods, although having low memory overhead, necessitate numerous ( $\sim 100$ ) forward passes, leading to high computational overhead and latency. Thus, these approaches are not suitable…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsCCD and CMOS Imaging Sensors · Anomaly Detection Techniques and Applications · Adversarial Robustness in Machine Learning

MethodsDropout · Monte Carlo Dropout