FIXAR: A Fixed-Point Deep Reinforcement Learning Platform with   Quantization-Aware Training and Adaptive Parallelism

Je Yang; Seongmin Hong; Joo-Young Kim

arXiv:2102.12103·cs.AR·February 25, 2021·5 cites

FIXAR: A Fixed-Point Deep Reinforcement Learning Platform with Quantization-Aware Training and Adaptive Parallelism

Je Yang, Seongmin Hong, Joo-Young Kim

PDF

Open Access

TL;DR

FIXAR is a deep reinforcement learning platform that uses fixed-point arithmetic and adaptive parallelism, achieving high throughput and energy efficiency through quantization-aware training and hardware/software co-design.

Contribution

It introduces a fixed-point deep RL platform with quantization-aware training and configurable parallelism, enhancing efficiency and speed over traditional CPU-GPU systems.

Findings

01

Achieves 25293.3 inferences/sec training throughput

02

Attains 2638.0 inferences/sec/W efficiency

03

Outperforms CPU-GPU platforms by 2.7x speed and 15.4x energy efficiency

Abstract

In this paper, we present a deep reinforcement learning platform named FIXAR which employs fixed-point data types and arithmetic units for the first time using a SW/HW co-design approach. Starting from 32-bit fixed-point data, Quantization-Aware Training (QAT) reduces its data precision based on the range of activations and performs retraining to minimize the reward degradation. FIXAR proposes the adaptive array processing core composed of configurable processing elements to support both intra-layer parallelism and intra-batch parallelism for high-throughput inference and training. Finally, FIXAR was implemented on Xilinx U50 and achieves 25293.3 inferences per second (IPS) training throughput and 2638.0 IPS/W accelerator efficiency, which is 2.7 times faster and 15.4 times more energy efficient than those of the CPU-GPU platform without any accuracy degradation.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Advanced Neural Network Applications · Advanced Memory and Neural Computing