LRD-MPC: Efficient MPC Inference through Low-rank Decomposition
Tingting Tang, Yongqin Wang, Murali Annavaram

TL;DR
This paper introduces LRD-MPC, a method that applies low-rank decomposition to neural network layers to significantly improve the efficiency of secure MPC inference, reducing computation, communication, and energy costs.
Contribution
It proposes a novel application of low-rank decomposition in MPC to optimize neural network inference, along with techniques to mitigate associated overheads, achieving substantial speedups and energy savings.
Findings
Up to 25% speedup in 2-party MPC protocols
Up to 33% speedup in 3-party MPC protocols
52% reduction in GPU energy consumption
Abstract
Secure Multi-party Computation (MPC) enables untrusted parties to jointly compute a function without revealing their inputs. Its application to machine learning (ML) has gained significant attention, particularly for secure inference services deployed across multiple cloud virtual machines (VMs), where each VM acts as an MPC party. Model providers secret-share model weights, and users secret-share inputs, ensuring that each server operates only on random shares. While MPC provides strong cryptographic guarantees, it incurs substantial computational and communication overhead. Deep neural networks rely heavily on convolutional and fully connected layers, which require costly matrix multiplications in MPC. To reduce this cost, we propose leveraging low-rank decomposition (LRD) for linear layers, replacing one large matrix multiplication with two smaller ones. Each matrix multiplication in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Cryptography and Data Security · Cryptography and Residue Arithmetic
