# Cheetah: Mixed Low-Precision Hardware & Software Co-Design Framework for   DNNs on the Edge

**Authors:** Hamed F. Langroudi, Zachariah Carmichael, David Pastuch, Dhireesha, Kudithipudi

arXiv: 1908.02386 · 2019-08-08

## TL;DR

Cheetah is a co-design framework enabling low-precision DNN training and inference on edge devices using mixed numerical formats, notably posits, to improve efficiency and performance.

## Contribution

It introduces a versatile framework supporting posit-based DNN training and inference, with mixed-precision formats, for edge computing applications.

## Key findings

- 16-bit posits outperform 16-bit floating point in training.
- Inference with 5-8 bit posits improves performance-energy trade-offs.
- Framework supports various quantization approaches and formats.

## Abstract

Low-precision DNNs have been extensively explored in order to reduce the size of DNN models for edge devices. Recently, the posit numerical format has shown promise for DNN data representation and compute with ultra-low precision in [5..8]-bits. However, previous studies were limited to studying posit for DNN inference only. In this paper, we propose the Cheetah framework, which supports both DNN training and inference using posits, as well as other commonly used formats. Additionally, the framework is amenable for different quantization approaches and supports mixed-precision floating point and fixed-point numerical formats. Cheetah is evaluated on three datasets: MNIST, Fashion MNIST, and CIFAR-10. Results indicate that 16-bit posits outperform 16-bit floating point in DNN training. Furthermore, performing inference with [5..8]-bit posits improves the trade-off between performance and energy-delay-product over both [5..8]-bit float and fixed-point.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1908.02386/full.md

## Figures

23 figures with captions in the complete paper: https://tomesphere.com/paper/1908.02386/full.md

## References

45 references — full list in the complete paper: https://tomesphere.com/paper/1908.02386/full.md

---
Source: https://tomesphere.com/paper/1908.02386