SPDZCoder: Combining Expert Knowledge with LLMs for Generating Privacy-Computing Code
Xiaoning Dong, Peilin Xin, Jia Li, Wei Xu

TL;DR
SPDZCoder is a rule-based framework that enhances large language models with expert knowledge to automatically generate privacy-preserving code, significantly improving translation accuracy from Python to MP-SPDZ without additional training data.
Contribution
It introduces a novel expert knowledge collection process and a three-stage translation pipeline for Python to MP-SPDZ code conversion, addressing data scarcity and evaluation challenges.
Findings
Achieves 85.94% pass@1 correctness, outperforming baselines.
Uses a high-quality expert knowledge base for translation.
Constructed a benchmark dataset for evaluation.
Abstract
Privacy computing receives increasing attention but writing privacy computing code remains challenging for developers due to limited library functions, necessitating function implementation from scratch, and data-oblivious requirement, contradicting intuitive thinking and usual practices of programmers. Automating the generation of privacy computing code with Large Language Models can streamline development effort and lower the barrier to using privacy computing frameworks. However, existing LLMs still encounter challenges in code translation for privacy-preserving computation, such as translating Python to MP-SPDZ, due to the scarcity of MP-SPDZ data required for effective pre-training or fine-tuning. Moreover, the lack of a benchmark further complicates the evaluation of translation quality. To address the limitations, this work proposes SPDZCoder, a rule-based framework that combines…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Digital and Cyber Forensics
