Inhibitor Transformers and Gated RNNs for Torus Efficient Fully Homomorphic Encryption
Rickard Br\"annvall, Tony Zhang, Henrik Forsgren, Andrei Stoian, Fredrik Sandin, Marcus Liwicki

TL;DR
This paper proposes inhibitor-based modifications to Transformers and Gated RNNs that replace costly multiplications with additive operations, enabling efficient encrypted inference under Fully Homomorphic Encryption while maintaining competitive accuracy.
Contribution
It introduces inhibitor designs that enable integer-only, low-depth neural network architectures suitable for FHE, significantly improving encrypted inference efficiency.
Findings
3-6 times speedup in encrypted inference
30-50% reduction in plaintext inference time
Maintains competitive accuracy on benchmark datasets
Abstract
This paper introduces efficient modifications to neural network-based sequence processing approaches, laying new grounds for scalable privacy-preserving machine learning under Fully Homomorphic Encryption (FHE). Transformers are now ubiquitous in AI applications and have largely supplanted Gated Recurrent Neural Networks (RNNs) as the standard architecture for sequence modeling. Both architectures rely on costly multiplications and complex activations that hinder encrypted inference. We focus on TFHE, which supports deep circuit evaluation and efficient univariate function evaluation but makes variable-to-variable multiplication particularly expensive. To address this, we propose inhibitor designs for Transformers and gated RNNs that replace multiplications and Softmax/Sigmoid activations with additive and ReLU-based operations. These changes enable integer-only computation, reduce…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCryptography and Residue Arithmetic · Brain Tumor Detection and Classification · Ferroelectric and Negative Capacitance Devices
MethodsSigmoid Activation · Gated Recurrent Unit · Tanh Activation · Long Short-Term Memory
