Reinforcement-Guided Synthetic Data Generation for Privacy-Sensitive Identity Recognition
Xuemei Jia, Jiawei Du, Hui Wei, Jun Chen, Joey Tianyi Zhou, Zheng Wang

TL;DR
This paper introduces a reinforcement-guided framework for generating synthetic data tailored to privacy-sensitive identity recognition, improving data quality and model performance under data scarcity.
Contribution
It presents a novel reinforcement learning approach that adapts pretrained generative models to specific privacy-sensitive tasks, optimizing for realism and diversity.
Findings
Enhanced generation fidelity and classification accuracy.
Effective domain adaptation with limited data.
Strong generalization to new categories in small-data settings.
Abstract
High-fidelity generative models are increasingly needed in privacy-sensitive scenarios, where access to data is severely restricted due to regulatory and copyright constraints. This scarcity hampers model development--ironically, in settings where generative models are most needed to compensate for the lack of data. This creates a self-reinforcing challenge: limited data leads to poor generative models, which in turn fail to mitigate data scarcity. To break this cycle, we propose a reinforcement-guided synthetic data generation framework that adapts general-domain generative priors to privacy-sensitive identity recognition tasks. We first perform a cold-start adaptation to align a pretrained generator with the target domain, establishing semantic relevance and initial fidelity. Building on this foundation, we introduce a multi-objective reward that jointly optimizes semantic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
