BEEM: Boosting Performance of Early Exit DNNs using Multi-Exit Classifiers as Experts
Divya Jyoti Bajpai, Manjesh Kumar Hanawal

TL;DR
BEEM introduces a novel multi-exit classifier ensemble approach for early exit DNNs, improving inference speed while maintaining or enhancing accuracy across vision and language tasks.
Contribution
It proposes a new confidence aggregation criterion for early exit decisions, leveraging ensemble effects of expert classifiers to boost performance.
Findings
Achieves 1.5x to 2.1x speed-up over state-of-the-art EE methods.
Maintains comparable accuracy in challenging tasks and improves in easier tasks.
Demonstrates effectiveness on COCO image captioning and GLUE language benchmarks.
Abstract
Early Exit (EE) techniques have emerged as a means to reduce inference latency in Deep Neural Networks (DNNs). The latency improvement and accuracy in these techniques crucially depend on the criteria used to make exit decisions. We propose a new decision criterion where exit classifiers are treated as experts BEEM and aggregate their confidence scores. The confidence scores are aggregated only if neighbouring experts are consistent in prediction as the samples pass through them, thus capturing their ensemble effect. A sample exits when the aggregated confidence value exceeds a threshold. The threshold is set using the error rates of the intermediate exits aiming to surpass the performance of conventional DNN inference. Experimental results on the COCO dataset for Image captioning and GLUE datasets for various language tasks demonstrate that our method enhances the performance of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech Recognition and Synthesis
