Invocation-driven Neural Approximate Computing with a Multiclass-Classifier and Multiple Approximators
Haiyue Song, Chengwen Xu, Qiang Xu, Zhuoran Song, Naifeng Jing,, Xiaoyao Liang, Li Jiang

TL;DR
This paper introduces MCMA, a novel neural approximate computing architecture with multiple approximators and a multiclass classifier, significantly increasing invocation rates and energy efficiency by optimizing the mapping of inputs to approximators.
Contribution
The paper proposes a new MCMA architecture with shared hardware for multiple approximators and a multiclass classifier, enabling higher invocation rates and energy savings.
Findings
Increased invocation rate of approximators.
Enhanced energy efficiency in neural approximate computing.
Effective co-training methods for MCMA architecture.
Abstract
Neural approximate computing gains enormous energy-efficiency at the cost of tolerable quality-loss. A neural approximator can map the input data to output while a classifier determines whether the input data are safe to approximate with quality guarantee. However, existing works cannot maximize the invocation of the approximator, resulting in limited speedup and energy saving. By exploring the mapping space of those target functions, in this paper, we observe a nonuniform distribution of the approximation error incurred by the same approximator. We thus propose a novel approximate computing architecture with a Multiclass-Classifier and Multiple Approximators (MCMA). These approximators have identical network topologies and thus can share the same hardware resource in a neural processing unit(NPU) clip. In the runtime, MCMA can swap in the invoked approximator by merely shipping the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Memory and Neural Computing · Ferroelectric and Negative Capacitance Devices · Neural Networks and Reservoir Computing
