Knowledge Distillation Circumvents Nonlinearity for Optical Convolutional Neural Networks
Jinlin Xiang, Shane Colburn, Arka Majumdar, Eli Shlizerman

TL;DR
This paper introduces a spectral CNN architecture that uses knowledge distillation to replace nonlinearities with linear spectral operations, enabling faster optical implementations without sacrificing accuracy.
Contribution
It proposes a novel spectral CNN linear architecture trained via knowledge distillation to bypass nonlinearities, facilitating optical implementation and improved efficiency.
Findings
Spectral CNN with knowledge distillation surpasses linear CNN performance.
The linear spectral network approaches nonlinear CNN accuracy in image tasks.
Optical 4f system implementation benefits from the linear spectral approach.
Abstract
In recent years, Convolutional Neural Networks (CNNs) have enabled ubiquitous image processing applications. As such, CNNs require fast runtime (forward propagation) to process high-resolution visual streams in real time. This is still a challenging task even with state-of-the-art graphics and tensor processing units. The bottleneck in computational efficiency primarily occurs in the convolutional layers. Performing operations in the Fourier domain is a promising way to accelerate forward propagation since it transforms convolutions into elementwise multiplications, which are considerably faster to compute for large kernels. Furthermore, such computation could be implemented using an optical 4f system with orders of magnitude faster operation. However, a major challenge in using this spectral approach, as well as in an optical implementation of CNNs, is the inclusion of a nonlinearity…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsKnowledge Distillation
