FIXAR: A Fixed-Point Deep Reinforcement Learning Platform with Quantization-Aware Training and Adaptive Parallelism
Je Yang, Seongmin Hong, Joo-Young Kim

TL;DR
FIXAR is a deep reinforcement learning platform that uses fixed-point arithmetic and adaptive parallelism, achieving high throughput and energy efficiency through quantization-aware training and hardware/software co-design.
Contribution
It introduces a fixed-point deep RL platform with quantization-aware training and configurable parallelism, enhancing efficiency and speed over traditional CPU-GPU systems.
Findings
Achieves 25293.3 inferences/sec training throughput
Attains 2638.0 inferences/sec/W efficiency
Outperforms CPU-GPU platforms by 2.7x speed and 15.4x energy efficiency
Abstract
In this paper, we present a deep reinforcement learning platform named FIXAR which employs fixed-point data types and arithmetic units for the first time using a SW/HW co-design approach. Starting from 32-bit fixed-point data, Quantization-Aware Training (QAT) reduces its data precision based on the range of activations and performs retraining to minimize the reward degradation. FIXAR proposes the adaptive array processing core composed of configurable processing elements to support both intra-layer parallelism and intra-batch parallelism for high-throughput inference and training. Finally, FIXAR was implemented on Xilinx U50 and achieves 25293.3 inferences per second (IPS) training throughput and 2638.0 IPS/W accelerator efficiency, which is 2.7 times faster and 15.4 times more energy efficient than those of the CPU-GPU platform without any accuracy degradation.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Advanced Neural Network Applications · Advanced Memory and Neural Computing
