Low-Stack HAETAE for Memory-Constrained Microcontrollers
Gustavo Banegas (LIX, GRACE), Kim Youngbeom, Seo Seog Chung, Vredendaal Christine Van

TL;DR
This paper introduces a low-stack, memory-efficient implementation of the HAETAE signature scheme for microcontrollers, significantly reducing stack usage while maintaining performance.
Contribution
It proposes novel techniques like rejection-aware pass decomposition and reverse-order entropy coding to drastically lower stack requirements for HAETAE on microcontrollers.
Findings
Stack reduction of up to 95% across security levels.
Verification fits within 8 kB RAM and is significantly faster than previous implementations.
Performance impact is minimal, with some improvements in verification speed.
Abstract
We present a low-stack implementation of the module-lattice signature scheme HAETAE, targeting microcontrollers with 8 kB-16 kB of available SRAM. On such devices, peak stack usage is often the binding constraint, and HAETAE's hyperball-based sampler, large transient polynomial vectors, and variable-length signature payloads (hint and high-bits arrays) pose a particular challenge. To address this we introduce (i) Rejection-aware pass decomposition, which isolates encoding to the post-acceptance path; (ii) Component-level early rejection, which short-circuits the response computation when a partial norm already exceeds the bound; and (iii) Reverse-order streaming entropy coding using range Asymmetric Numeral Systems (rANS), which eliminates full hint and high-bits staging buffers. Combined with streamed matrix generation, a two-pass hyperball sampler with streaming Gaussian backend, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
