Privacy-Aware Time Series Synthesis via Public Knowledge Distillation

Penghang Liu; Haibei Zhu; Eleonora Kreacic; Svitlana Vyetrenko

arXiv:2511.00700·cs.LG·November 4, 2025

Privacy-Aware Time Series Synthesis via Public Knowledge Distillation

Penghang Liu, Haibei Zhu, Eleonora Kreacic, Svitlana Vyetrenko

PDF

Open Access

TL;DR

This paper introduces Pub2Priv, a framework that uses public knowledge and self-attention to generate privacy-preserving synthetic time series data, improving privacy-utility balance in sensitive domains.

Contribution

The paper proposes a novel public knowledge distillation approach with a self-attention mechanism for better privacy-aware time series synthesis.

Findings

01

Outperforms state-of-the-art benchmarks in privacy-utility trade-offs

02

Effective across finance, energy, and commodity trading domains

03

Introduces a practical privacy assessment metric

Abstract

Sharing sensitive time series data in domains such as finance, healthcare, and energy consumption, such as patient records or investment accounts, is often restricted due to privacy concerns. Privacy-aware synthetic time series generation addresses this challenge by enforcing noise during training, inherently introducing a trade-off between privacy and utility. In many cases, sensitive sequences is correlated with publicly available, non-sensitive contextual metadata (e.g., household electricity consumption may be influenced by weather conditions and electricity prices). However, existing privacy-aware data generation methods often overlook this opportunity, resulting in suboptimal privacy-utility trade-offs. In this paper, we present Pub2Priv, a novel framework for generating private time series data by leveraging heterogeneous public knowledge. Our model employs a self-attention…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Machine Learning in Healthcare · Blockchain Technology Applications and Security