Simultaneous Unlearning of Multiple Protected User Attributes From Variational Autoencoder Recommenders Using Adversarial Training
Gustavo Escobedo, Christian Ganh\"or, Stefan Brandl, Mirjam, Augstein, Markus Schedl

TL;DR
This paper introduces AdvXMultVAE, a novel adversarial training method that unlearns multiple user protected attributes simultaneously in variational autoencoder recommender systems, enhancing fairness and privacy.
Contribution
It proposes a new approach combining VAE and adversarial training to unlearn multiple protected attributes at once, improving fairness and privacy in recommender systems.
Findings
Outperforms single-attribute removal methods in fairness.
Effectively mitigates demographic biases.
Enhances anonymity of user embeddings.
Abstract
In widely used neural network-based collaborative filtering models, users' history logs are encoded into latent embeddings that represent the users' preferences. In this setting, the models are capable of mapping users' protected attributes (e.g., gender or ethnicity) from these user embeddings even without explicit access to them, resulting in models that may treat specific demographic user groups unfairly and raise privacy issues. While prior work has approached the removal of a single protected attribute of a user at a time, multiple attributes might come into play in real-world scenarios. In the work at hand, we present AdvXMultVAE which aims to unlearn multiple protected attributes (exemplified by gender and age) simultaneously to improve fairness across demographic user groups. For this purpose, we couple a variational autoencoder (VAE) architecture with adversarial training…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare · Generative Adversarial Networks and Image Synthesis · Human Pose and Action Recognition
