TL;DR
This paper presents optimized algorithms for polynomial multiplication over binary rings using AVX512 and VPCLMULQDQ instructions, achieving significant speedups in cryptographic protocols like HQC.
Contribution
It introduces new vectorized multiplication kernels leveraging AVX512 and VPCLMULQDQ, improving efficiency over previous AVX2-based methods in post-quantum cryptography.
Findings
Up to 39% reduction in processor cycles for polynomial multiplication.
Up to 12% speedup in key pair generation in HQC protocol.
Different kernels optimized for various polynomial degrees and instruction sets.
Abstract
Code-based cryptography is one of the main propositions for the post-quantum cryptographic context, and several protocols of this kind have been submitted on the NIST platform. Among them, BIKE and HQC are part of the five alternate candidates selected in the third round of the NIST standardization process in the KEM category. These two schemes make use of multiplication of large polynomials over binary rings, and due to the polynomial size (from 10,000 to 60,000 bits), this operation is one of the costliest during key generation, encapsulation, or decapsulation mechanisms. In this work, we revisit the different existing constant-time algorithms for arbitrary polynomial multiplication. We explore the different Karatsuba and Toom-Cook constructions in order to determine the best combinations for each polynomial degree range, in the context of AVX2 and AVX512 instruction sets. This leads…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
