# autoAx: An Automatic Design Space Exploration and Circuit Building   Methodology utilizing Libraries of Approximate Components

**Authors:** Vojtech Mrazek, Muhammad Abdullah Hanif, Zdenek Vasicek and, Lukas Sekanina, Muhammad Shafique

arXiv: 1902.10807 · 2019-06-13

## TL;DR

autoAx is a novel methodology that leverages machine learning to efficiently explore and combine approximate circuit libraries, enabling rapid design of energy-efficient accelerators with optimal tradeoffs.

## Contribution

The paper introduces a machine learning-based approach for automatic design space exploration and circuit combination in approximate computing, significantly reducing search time.

## Key findings

- Constructed hundreds of approximate accelerators for a Sobel edge detector.
- Identified Pareto-optimal tradeoffs between quality and hardware cost.
- Found approximately 10^3 important implementations out of 10^23 possibilities in hours.

## Abstract

Approximate computing is an emerging paradigm for developing highly energy-efficient computing systems such as various accelerators. In the literature, many libraries of elementary approximate circuits have already been proposed to simplify the design process of approximate accelerators. Because these libraries contain from tens to thousands of approximate implementations for a single arithmetic operation it is intractable to find an optimal combination of approximate circuits in the library even for an application consisting of a few operations. An open problem is "how to effectively combine circuits from these libraries to construct complex approximate accelerators". This paper proposes a novel methodology for searching, selecting and combining the most suitable approximate circuits from a set of available libraries to generate an approximate accelerator for a given application. To enable fast design space generation and exploration, the methodology utilizes machine learning techniques to create computational models estimating the overall quality of processing and hardware cost without performing full synthesis at the accelerator level. Using the methodology, we construct hundreds of approximate accelerators (for a Sobel edge detector) showing different but relevant tradeoffs between the quality of processing and hardware cost and identify a corresponding Pareto-frontier. Furthermore, when searching for approximate implementations of a generic Gaussian filter consisting of 17 arithmetic operations, the proposed approach allows us to identify approximately $10^3$ highly important implementations from $10^{23}$ possible solutions in a few hours, while the exhaustive search would take four months on a high-end processor.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1902.10807/full.md

## Figures

9 figures with captions in the complete paper: https://tomesphere.com/paper/1902.10807/full.md

## References

17 references — full list in the complete paper: https://tomesphere.com/paper/1902.10807/full.md

---
Source: https://tomesphere.com/paper/1902.10807