QDOT: Quantized Dot Product Kernel for Approximate High-Performance Computing
James Diffenderfer, Daniel Osei-Kuffuor, Harshitha Menon

TL;DR
This paper introduces qdot, an error-bounded approximate dot product kernel, enabling high-performance computing applications to reduce computation costs while maintaining guaranteed error bounds, demonstrated on scientific benchmarks.
Contribution
The paper proposes a general framework for designing error-bounded approximate computing strategies and applies it to create qdot, a novel approximate dot product kernel with theoretical error guarantees.
Findings
qdot provides a deterministic bound on approximation error.
qdot achieves significant quantization without affecting convergence.
Effective on scientific benchmarks like CG and Power method.
Abstract
Approximate computing techniques have been successful in reducing computation and power costs in several domains. However, error sensitive applications in high-performance computing are unable to benefit from existing approximate computing strategies that are not developed with guaranteed error bounds. While approximate computing techniques can be developed for individual high-performance computing applications by domain specialists, this often requires additional theoretical analysis and potentially extensive software modification. Hence, the development of low-level error-bounded approximate computing strategies that can be introduced into any high-performance computing application without requiring additional analysis or significant software alterations is desirable. In this paper, we provide a contribution in this direction by proposing a general framework for designing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsLow-power high-performance VLSI design · Parallel Computing and Optimization Techniques · VLSI and FPGA Design Techniques
