LIMO: Low-Power In-Memory-Annealer and Matrix-Multiplication Primitive for Edge Computing
Amod Holla, Sumedh Chatterjee, Sutanu Sen, Anushka Mukherjee, Fernando Garcia-Redondo, Dwaipayan Biswas, Francesca Iacopi, Kaushik Roy

TL;DR
LIMO introduces a low-power in-memory annealer and matrix multiplication primitive that enhances solution quality and efficiency for large-scale combinatorial optimization problems like TSP, with applications in neural network inference.
Contribution
The paper presents a novel mixed-signal macro, LIMO, that implements an in-memory annealing algorithm with reduced search space complexity and supports parallel refinement, improving performance on large TSP instances.
Findings
Achieves superior solution quality and faster solutions for TSP up to 85,900 cities.
Supports neural network inference with comparable accuracy to software, but with lower latency and energy.
Modular design allows reuse for vector-matrix multiplications in various applications.
Abstract
Combinatorial optimization (CO) underpins applications in science and engineering, ranging from logistics to electronic design automation. A classic example is the NP-complete Traveling Salesman Problem (TSP). Finding exact solutions for large-scale TSP instances remains computationally intractable; on von Neumann architectures, such solvers are constrained by the memory wall, incurring compute-memory traffic that grows with instance size. Metaheuristics, such as simulated annealing implemented on compute-in-memory (CiM) architectures, offer a way to mitigate the von Neumann bottleneck. This is accomplished by performing in-memory optimization cycles to rapidly find approximate solutions for TSP instances. Yet this approach suffers from degrading solution quality as instance size increases, owing to inefficient state-space exploration. To address this, we present LIMO, a mixed-signal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGraph Theory and Algorithms · Parallel Computing and Optimization Techniques · Ferroelectric and Negative Capacitance Devices
