Kernel Approximation using Analog In-Memory Computing
Julian B\"uchel, Giacomo Camposampiero, Athanasios Vasilopoulos, Corey, Lammie, Manuel Le Gallo, Abbas Rahimi, Abu Sebastian

TL;DR
This paper presents a novel kernel approximation method tailored for Analog In-Memory Computing architectures, significantly reducing computational costs while maintaining high accuracy in machine learning tasks.
Contribution
It introduces a hardware-compatible kernel approximation approach that executes in-memory operations, demonstrated on the IBM HERMES AIMC chip, improving efficiency and scalability.
Findings
Less than 1% accuracy drop in kernel ridge classification
Within 1% accuracy on kernelized attention in Transformers
Superior energy efficiency compared to digital accelerators
Abstract
Kernel functions are vital ingredients of several machine learning algorithms, but often incur significant memory and computational costs. We introduce an approach to kernel approximation in machine learning algorithms suitable for mixed-signal Analog In-Memory Computing (AIMC) architectures. Analog In-Memory Kernel Approximation addresses the performance bottlenecks of conventional kernel-based methods by executing most operations in approximate kernel methods directly in memory. The IBM HERMES Project Chip, a state-of-the-art phase-change memory based AIMC chip, is utilized for the hardware demonstration of kernel approximation. Experimental results show that our method maintains high accuracy, with less than a 1% drop in kernel-based ridge classification benchmarks and within 1% accuracy on the Long Range Arena benchmark for kernelized attention in Transformer neural networks.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Memory and Neural Computing · Neural Networks and Applications · Neural Networks and Reservoir Computing
MethodsAttention Is All You Need · Linear Layer · Layer Normalization · Position-Wise Feed-Forward Layer · Adam · Multi-Head Attention · Residual Connection · Byte Pair Encoding · Dropout · Absolute Position Encodings
