# EmBench: Quantifying Performance Variations of Deep Neural Networks   across Modern Commodity Devices

**Authors:** Mario Almeida, Stefanos Laskaridis, Ilias Leontiadis, Stylianos I., Venieris, Nicholas D. Lane

arXiv: 1905.07346 · 2019-05-20

## TL;DR

This paper systematically evaluates the performance of various state-of-the-art deep neural networks across different commodity devices to understand hardware compatibility and identify bottlenecks, aiding future co-design efforts.

## Contribution

It provides a comprehensive benchmarking framework and insights into DNN performance variations on modern hardware, guiding efficient neural network and accelerator design.

## Key findings

- Identification of hardware bottlenecks for different DNN architectures
- Performance variation analysis across multiple commodity devices
- Guidelines for co-design of DNNs and hardware accelerators

## Abstract

In recent years, advances in deep learning have resulted in unprecedented leaps in diverse tasks spanning from speech and object recognition to context awareness and health monitoring. As a result, an increasing number of AI-enabled applications are being developed targeting ubiquitous and mobile devices. While deep neural networks (DNNs) are getting bigger and more complex, they also impose a heavy computational and energy burden on the host devices, which has led to the integration of various specialized processors in commodity devices. Given the broad range of competing DNN architectures and the heterogeneity of the target hardware, there is an emerging need to understand the compatibility between DNN-platform pairs and the expected performance benefits on each platform. This work attempts to demystify this landscape by systematically evaluating a collection of state-of-the-art DNNs on a wide variety of commodity devices. In this respect, we identify potential bottlenecks in each architecture and provide important guidelines that can assist the community in the co-design of more efficient DNNs and accelerators.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1905.07346/full.md

## Figures

16 figures with captions in the complete paper: https://tomesphere.com/paper/1905.07346/full.md

## References

29 references — full list in the complete paper: https://tomesphere.com/paper/1905.07346/full.md

---
Source: https://tomesphere.com/paper/1905.07346