TL;DR
This paper introduces Adv-MultVAE, an adversarial training method that effectively removes protected user attribute information from recommendation models while maintaining high recommendation accuracy.
Contribution
It proposes a novel adversarial training approach integrated into MultVAE to mitigate biases related to protected attributes in recommendation systems.
Findings
Adv-MultVAE reduces the leakage of protected attribute information.
The method maintains recommendation performance with minimal loss.
Bias mitigation is effective on multiple datasets.
Abstract
Collaborative filtering algorithms capture underlying consumption patterns, including the ones specific to particular demographics or protected information of users, e.g. gender, race, and location. These encoded biases can influence the decision of a recommendation system (RS) towards further separation of the contents provided to various demographic subgroups, and raise privacy concerns regarding the disclosure of users' protected attributes. In this work, we investigate the possibility and challenges of removing specific protected information of users from the learned interaction representations of a RS algorithm, while maintaining its effectiveness. Specifically, we incorporate adversarial training into the state-of-the-art MultVAE architecture, resulting in a novel model, Adversarial Variational Auto-Encoder with Multinomial Likelihood (Adv-MultVAE), which aims at removing the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
