Uncovering Hidden Subspaces in Video Diffusion Models Using Re-Identification
Mischa Dombrowski, Hadrien Reynaud, Bernhard Kainz

TL;DR
This paper investigates the limitations of latent video diffusion models in healthcare, revealing they only learn a small fraction of training data and proposing methods to evaluate and improve their fidelity and privacy.
Contribution
It introduces a novel approach using re-identification models trained in latent space to assess the subspace coverage of synthetic videos and improve downstream task performance.
Findings
Only up to 30.8% of training videos are learned in latent models
Latent space training is more efficient and generalizes better
Proposed methods help evaluate and enhance synthetic video fidelity
Abstract
Latent Video Diffusion Models can easily deceive casual observers and domain experts alike thanks to the produced image quality and temporal consistency. Beyond entertainment, this creates opportunities around safe data sharing of fully synthetic datasets, which are crucial in healthcare, as well as other domains relying on sensitive personal information. However, privacy concerns with this approach have not fully been addressed yet, and models trained on synthetic data for specific downstream tasks still perform worse than those trained on real data. This discrepancy may be partly due to the sampling space being a subspace of the training videos, effectively reducing the training data size for downstream models. Additionally, the reduced temporal consistency when generating long videos could be a contributing factor. In this paper, we first show that training privacy-preserving…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Image and Signal Denoising Methods
MethodsDiffusion · Focus
