DP-aware AdaLN-Zero: Taming Conditioning-Induced Heavy-Tailed Gradients in Differentially Private Diffusion
Tao Huang, Jiayang Meng, Xu Yang, Chen Hou, Hong Chen

TL;DR
This paper introduces DP-aware AdaLN-Zero, a sensitivity-aware conditioning mechanism for differentially private diffusion models that reduces heavy-tailed gradients and improves utility in time-series tasks.
Contribution
It proposes a novel conditioning method that limits gradient tail events without altering DP-SGD, enhancing private diffusion model performance.
Findings
Improves interpolation, imputation, and forecasting under privacy constraints.
Reduces gradient tail events and clipping bias in private training.
Achieves better utility on real-world and benchmark datasets.
Abstract
Condition injection enables diffusion models to generate context-aware outputs, which is essential for many time-series tasks. However, heterogeneous conditional contexts (e.g., observed history, missingness patterns or outlier covariates) can induce heavy-tailed per-example gradients. Under Differentially Private Stochastic Gradient Descent (DP-SGD), these rare conditioning-driven heavy-tailed gradients disproportionately trigger global clipping, resulting in outlier-dominated updates, larger clipping bias, and degraded utility under a fixed privacy budget. In this paper, we propose DP-aware AdaLN-Zero, a drop-in sensitivity-aware conditioning mechanism for conditional diffusion transformers that limits conditioning-induced gain without modifying the DP-SGD mechanism. DP-aware AdaLN-Zero jointly constrains conditioning representation magnitude and AdaLN modulation parameters via…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Generative Adversarial Networks and Image Synthesis · Advanced Neuroimaging Techniques and Applications
