Kernel Approximation using Analog In-Memory Computing

Julian B\"uchel; Giacomo Camposampiero; Athanasios Vasilopoulos; Corey; Lammie; Manuel Le Gallo; Abbas Rahimi; Abu Sebastian

arXiv:2411.03375·cs.LG·November 7, 2024

Kernel Approximation using Analog In-Memory Computing

Julian B\"uchel, Giacomo Camposampiero, Athanasios Vasilopoulos, Corey, Lammie, Manuel Le Gallo, Abbas Rahimi, Abu Sebastian

PDF

Open Access 1 Repo

TL;DR

This paper presents a novel kernel approximation method tailored for Analog In-Memory Computing architectures, significantly reducing computational costs while maintaining high accuracy in machine learning tasks.

Contribution

It introduces a hardware-compatible kernel approximation approach that executes in-memory operations, demonstrated on the IBM HERMES AIMC chip, improving efficiency and scalability.

Findings

01

Less than 1% accuracy drop in kernel ridge classification

02

Within 1% accuracy on kernelized attention in Transformers

03

Superior energy efficiency compared to digital accelerators

Abstract

Kernel functions are vital ingredients of several machine learning algorithms, but often incur significant memory and computational costs. We introduce an approach to kernel approximation in machine learning algorithms suitable for mixed-signal Analog In-Memory Computing (AIMC) architectures. Analog In-Memory Kernel Approximation addresses the performance bottlenecks of conventional kernel-based methods by executing most operations in approximate kernel methods directly in memory. The IBM HERMES Project Chip, a state-of-the-art phase-change memory based AIMC chip, is utilized for the hardware demonstration of kernel approximation. Experimental results show that our method maintains high accuracy, with less than a 1% drop in kernel-based ridge classification benchmarks and within 1% accuracy on the Long Range Arena benchmark for kernelized attention in Transformer neural networks.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

IBM/kernel-approximation-using-analog-in-memory-computing
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Memory and Neural Computing · Neural Networks and Applications · Neural Networks and Reservoir Computing

MethodsAttention Is All You Need · Linear Layer · Layer Normalization · Position-Wise Feed-Forward Layer · Adam · Multi-Head Attention · Residual Connection · Byte Pair Encoding · Dropout · Absolute Position Encodings