# Multi-Precision Quantized Neural Networks via Encoding Decomposition of   -1 and +1

**Authors:** Qigong Sun, Fanhua Shang, Kang Yang, Xiufang Li, Yan Ren, and Licheng Jiao

arXiv: 1905.13389 · 2019-06-03

## TL;DR

This paper introduces a novel encoding scheme for quantized neural networks using {-1,+1} to create multi-branch binary networks, enabling efficient implementation on hardware like FPGA and ASIC, with minimal performance loss.

## Contribution

The paper proposes a new encoding decomposition method for QNNs into multi-branch binary networks, facilitating flexible precision and hardware-efficient deployment.

## Key findings

- Achieves model compression and acceleration via bitwise operations.
- Maintains near full-precision accuracy on ImageNet and object detection tasks.
- Supports arbitrary precision encoding tailored to hardware constraints.

## Abstract

The training of deep neural networks (DNNs) requires intensive resources both for computation and for storage performance. Thus, DNNs cannot be efficiently applied to mobile phones and embedded devices, which seriously limits their applicability in industry applications. To address this issue, we propose a novel encoding scheme of using {-1,+1} to decompose quantized neural networks (QNNs) into multi-branch binary networks, which can be efficiently implemented by bitwise operations (xnor and bitcount) to achieve model compression, computational acceleration and resource saving. Based on our method, users can easily achieve different encoding precisions arbitrarily according to their requirements and hardware resources. The proposed mechanism is very suitable for the use of FPGA and ASIC in terms of data storage and computation, which provides a feasible idea for smart chips. We validate the effectiveness of our method on both large-scale image classification tasks (e.g., ImageNet) and object detection tasks. In particular, our method with low-bit encoding can still achieve almost the same performance as its full-precision counterparts.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1905.13389/full.md

## Figures

2 figures with captions in the complete paper: https://tomesphere.com/paper/1905.13389/full.md

## References

41 references — full list in the complete paper: https://tomesphere.com/paper/1905.13389/full.md

---
Source: https://tomesphere.com/paper/1905.13389