Accelerating Elliptic Curve Point Additions on Versal AI Engine for Multi-scalar Multiplication
Ayumi Ohno, Kotaro Shimamura, Shinya Takamaeda-Yamazaki

TL;DR
This paper presents hardware-aware optimizations for elliptic curve point additions to accelerate multi-scalar multiplication on Versal AI Engine, achieving significant speedup and efficient bandwidth utilization.
Contribution
It introduces novel algorithmic and spatial mapping optimizations tailored for Versal AI Engines to enhance elliptic curve computations in cryptography.
Findings
Achieved 568x speedup over CPU implementation.
Utilized 50.2% of theoretical memory bandwidth.
Developed carry propagation techniques exploiting VLIW and SIMD.
Abstract
Multi-scalar multiplication (MSM) is crucial in cryptographic applications and computationally intensive in zero-knowledge proofs. MSM involves accumulating the products of scalars and points on an elliptic curve over a 377-bit modulus, and the Pippenger algorithm converts MSM into a series of elliptic curve point additions (PADDs) with high parallelism. This study investigates accelerating MSM on the Versal ACAP platform, an emerging hardware that employs a spatial architecture integrating 400 AI Engines (AIEs) with programmable logic and a processing system. AIEs are SIMD-based VLIW processors capable of performing vector multiply-accumulate operations, making them well-suited for multiplication-heavy workloads in PADD. Unlike simpler multiplication tasks in previous studies, cryptographic computations also require complex operations such as carry propagation. These operations…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCryptography and Residue Arithmetic · Medical Imaging and Analysis · Advanced Numerical Analysis Techniques
