SCONNA: A Stochastic Computing Based Optical Accelerator for Ultra-Fast, Energy-Efficient Inference of Integer-Quantized CNNs
Sairam Sri Vatsavai, Venkata Sai Praneeth Karempudi, Ishan Thakkar,, Ahmad Salehi, and Todd Hastings

TL;DR
SCONNA introduces a novel optical accelerator combining stochastic computing and microring resonator technology to significantly boost throughput and energy efficiency for high-precision CNN inference.
Contribution
It pioneers the integration of stochastic computing with MRR-based optical accelerators, enabling high-precision CNN inference with improved performance and flexibility.
Findings
Up to 66.5x increase in FPS over prior photonic accelerators.
Up to 90x improvement in FPS/W efficiency.
Minimal accuracy loss of up to 1.5% for CNNs.
Abstract
The acceleration of a CNN inference task uses convolution operations that are typically transformed into vector-dot-product (VDP) operations. Several photonic microring resonators (MRRs) based hardware architectures have been proposed to accelerate integer-quantized CNNs with remarkably higher throughput and energy efficiency compared to their electronic counterparts. However, the existing photonic MRR-based analog accelerators exhibit a very strong trade-off between the achievable input/weight precision and VDP operation size, which severely restricts their achievable VDP operation size for the quantized input/weight precision of 4 bits and higher. The restricted VDP operation size ultimately suppresses computing throughput to severely diminish the achievable performance benefits. To address this shortcoming, we for the first time present a merger of stochastic computing and MRR-based…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Reservoir Computing · Optical Network Technologies · Advanced Memory and Neural Computing
MethodsConvolution
