Lap2: Revisiting Laplace DP-SGD for High Dimensions via Majorization Theory

Meisam Mohammady; Qin Yang; Nicholas Stout; Ayesha Samreen; Han Wang; Christopher J Quinn; Yuan Hong

arXiv:2602.23516·cs.CR·March 10, 2026

Lap2: Revisiting Laplace DP-SGD for High Dimensions via Majorization Theory

Meisam Mohammady, Qin Yang, Nicholas Stout, Ayesha Samreen, Han Wang, Christopher J Quinn, Yuan Hong

PDF

Open Access

TL;DR

This paper introduces Lap2, a novel method for L2 clipping in Laplace DP-SGD that overcomes high-dimensional limitations, enabling privacy-preserving training of large models with improved utility.

Contribution

Lap2 leverages majorization theory and coordinate-wise bounds to enable effective L2 clipping in high-dimensional Laplace DP-SGD, improving privacy utility trade-offs.

Findings

01

Achieves comparable or better accuracy than Gaussian DP-SGD under strong privacy constraints.

02

Enables privacy-preserving fine-tuning of large models like RoBERTa-base with high utility.

03

Scales gracefully with model dimension, handling thousands of moments.

Abstract

Differentially Private Stochastic Gradient Descent (DP-SGD) is a cornerstone technique for ensuring privacy in deep learning, widely used in both training from scratch and fine-tuning large-scale language models. While DP-SGD predominantly relies on the Gaussian mechanism, the Laplace mechanism remains underutilized due to its reliance on L1 norm clipping. This constraint severely limits its practicality in high-dimensional models because the L1 norm of an n-dimensional gradient can be up to sqrt(n) times larger than its L2 norm. As a result, the required noise scale grows significantly with model size, leading to poor utility or untrainable models. In this work, we introduce Lap2, a new solution that enables L2 clipping for Laplace DP-SGD while preserving strong privacy guarantees. We overcome the dimensionality-driven clipping barrier by computing coordinate-wise moment bounds and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Stochastic Gradient Optimization Techniques · Adversarial Robustness in Machine Learning