A comprehensive study on ILP acceleration accounting for sparsity, area, energy, data movement using near-memory architecture
Siddhartha Raman Sundara Raman, Lizy K John, Jaydeep P. Kulkarni

TL;DR
This paper introduces SPARK, a near-memory ILP accelerator that significantly improves performance and energy efficiency for sparse and dense ILP and LP problems by leveraging sparsity detection and reuse-aware computation.
Contribution
SPARK reuses existing CPU cache for near-cache ILP acceleration with minimal hardware overhead, enabling substantial performance and energy efficiency gains over CPUs and GPUs.
Findings
Up to 15x performance improvement over CPUs for sparse ILPs
Up to 740x energy reduction compared to GPUs for sparse ILPs
Supports both sparse and dense ILPs and LPs with broad applicability.
Abstract
Integer Linear Programming (ILP) is widely used for solving real-world optimization problems, including network routing, map routing, and traffic scheduling. However, ILP algorithms are sparse and branch-intensive, making them inefficient on conventional CPUs and GPUs. Prior work has shown that large-scale ILP problems can require tens of hours of execution time even on massively parallel systems, limiting their applicability to time-sensitive decision-making workloads. Existing ILP solvers such as Gurobi employ software-level optimizations to handle sparsity on CPUs, but still face throughput limitations. GPU-based ILP solvers are also constrained because GPUs are not well suited for sparse and branch-heavy workloads, leading to thread divergence, under-utilization of streaming multiprocessors, and frequent host-device interactions. This paper presents SPARK, a sparsity-aware,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
