TIPRDC: Task-Independent Privacy-Respecting Data Crowdsourcing Framework for Deep Learning with Anonymized Intermediate Representations
Ang Li, Yixiao Duan, Huanrui Yang, Yiran Chen, Jianlei Yang

TL;DR
TIPRDC is a novel framework that enables privacy-preserving data crowdsourcing for deep learning by generating anonymized intermediate representations that hide private info while retaining useful data features for unknown tasks.
Contribution
It introduces a task-independent, privacy-respecting data sharing framework using anonymized features, addressing privacy issues in crowdsourced datasets for deep learning.
Findings
Effective privacy protection against attribute inference attacks.
Retains sufficient data information for unknown learning tasks.
Outperforms existing privacy-preserving methods in experiments.
Abstract
The success of deep learning partially benefits from the availability of various large-scale datasets. These datasets are often crowdsourced from individual users and contain private information like gender, age, etc. The emerging privacy concerns from users on data sharing hinder the generation or use of crowdsourcing datasets and lead to hunger of training data for new deep learning applications. One na\"{\i}ve solution is to pre-process the raw data to extract features at the user-side, and then only the extracted features will be sent to the data collector. Unfortunately, attackers can still exploit these extracted features to train an adversary classifier to infer private attributes. Some prior arts leveraged game theory to protect private attributes. However, these defenses are designed for known primary learning tasks, the extracted features work poorly for unknown learning…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
