SIG: A Synthetic Identity Generation Pipeline for Generating Evaluation Datasets for Face Recognition
Kassi Nzalasse, Rishav Raj, Eli Laird, Corey Clark

TL;DR
The paper introduces SIG, a pipeline for generating ethical, balanced synthetic face datasets with controllable attributes, to improve face recognition evaluation without privacy concerns.
Contribution
We present SIG, a novel pipeline for creating high-quality, ethically sourced synthetic face datasets with controllable demographic features for evaluation purposes.
Findings
ControlFace10k dataset effectively evaluates face recognition models.
Synthetic dataset helps analyze demographic bias in algorithms.
SIG enables rapid, ethical dataset generation without privacy issues.
Abstract
As Artificial Intelligence applications expand, the evaluation of models faces heightened scrutiny. Ensuring public readiness requires evaluation datasets, which differ from training data by being disjoint and ethically sourced in compliance with privacy regulations. The performance and fairness of face recognition systems depend significantly on the quality and representativeness of these evaluation datasets. This data is sometimes scraped from the internet without user's consent, causing ethical concerns that can prohibit its use without proper releases. In rare cases, data is collected in a controlled environment with consent, however, this process is time-consuming, expensive, and logistically difficult to execute. This creates a barrier for those unable to conjure the immense resources required to gather ethically sourced evaluation datasets. To address these challenges, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace recognition and analysis
MethodsNormalizing Flows · Sliced Iterative Generator
