Dual Precision Deep Neural Network

Jae Hyun Park; Ji Sub Choi; Jong Hwan Ko

arXiv:2009.02191·cs.LG·May 14, 2024

Dual Precision Deep Neural Network

Jae Hyun Park, Ji Sub Choi, Jong Hwan Ko

PDF

1 Repo

TL;DR

This paper introduces a dual-precision deep neural network that enables on-line switching between precision modes without re-training, balancing accuracy and complexity during inference.

Contribution

It proposes a novel dual-precision DNN architecture with a two-phase training process for simultaneous optimization of both precision modes.

Findings

01

Supports on-line precision switching without re-training

02

Optimizes both low- and high-precision modes effectively

03

Enhances inference flexibility and efficiency

Abstract

On-line Precision scalability of the deep neural networks(DNNs) is a critical feature to support accuracy and complexity trade-off during the DNN inference. In this paper, we propose dual-precision DNN that includes two different precision modes in a single model, thereby supporting an on-line precision switch without re-training. The proposed two-phase training process optimizes both low- and high-precision modes.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ParkJHyun/DualPrecisionDeepNeuralNetworks
tf

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.