Reinforcement-Guided Synthetic Data Generation for Privacy-Sensitive Identity Recognition

Xuemei Jia; Jiawei Du; Hui Wei; Jun Chen; Joey Tianyi Zhou; Zheng Wang

arXiv:2604.07884·cs.CV·April 10, 2026

Reinforcement-Guided Synthetic Data Generation for Privacy-Sensitive Identity Recognition

Xuemei Jia, Jiawei Du, Hui Wei, Jun Chen, Joey Tianyi Zhou, Zheng Wang

PDF

TL;DR

This paper introduces a reinforcement-guided framework for generating synthetic data tailored to privacy-sensitive identity recognition, improving data quality and model performance under data scarcity.

Contribution

It presents a novel reinforcement learning approach that adapts pretrained generative models to specific privacy-sensitive tasks, optimizing for realism and diversity.

Findings

01

Enhanced generation fidelity and classification accuracy.

02

Effective domain adaptation with limited data.

03

Strong generalization to new categories in small-data settings.

Abstract

High-fidelity generative models are increasingly needed in privacy-sensitive scenarios, where access to data is severely restricted due to regulatory and copyright constraints. This scarcity hampers model development--ironically, in settings where generative models are most needed to compensate for the lack of data. This creates a self-reinforcing challenge: limited data leads to poor generative models, which in turn fail to mitigate data scarcity. To break this cycle, we propose a reinforcement-guided synthetic data generation framework that adapts general-domain generative priors to privacy-sensitive identity recognition tasks. We first perform a cold-start adaptation to align a pretrained generator with the target domain, establishing semantic relevance and initial fidelity. Building on this foundation, we introduce a multi-objective reward that jointly optimizes semantic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.