A Scalable Approach to Performing Multiplication and Matrix Dot-Products in Unary
Yadu Kiran, Marc Riedel

TL;DR
This paper introduces a scalable deterministic method for performing multiplication and matrix dot-products in unary, reducing latency and area compared to traditional stochastic computing, with applications in machine learning.
Contribution
It presents a novel approximation technique that maintains low latency and scales efficiently, improving upon existing stochastic computing methods for matrix operations.
Findings
Outperforms traditional stochastic designs in matrix multiplication
Area scales with the square root of the precision parameter
Latency is significantly reduced compared to previous deterministic approaches
Abstract
Stochastic computing is a paradigm in which logical operations are performed on randomly generated bit streams. Complex arithmetic operations can be executed by simple logic circuits, resulting in a much smaller area footprint compared to conventional binary counterparts. However, the random or pseudorandom sources required for generating the bit streams are costly in terms of area and offset the advantages. Additionally, due to the inherent randomness, the computation lacks precision, limiting the applicability of this paradigm. Importantly, achieving reasonable accuracy in stochastic computing involves high latency. Recently, deterministic approaches to stochastic computing have been proposed, demonstrating that randomness is not a requirement. By structuring the computation deterministically, exact results can be obtained, and the latency greatly reduced. The bit stream generated…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsError Correcting Code Techniques · Stochastic Gradient Optimization Techniques · Quantum Computing Algorithms and Architecture
