COFFEE: Counterfactual Fairness for Personalized Text Generation in Explainable Recommendation
Nan Wang, Qifan Wang, Yi-Chia Wang, Maziar Sanjabi, Jingzhou Liu,, Hamed Firooz, Hongning Wang, Shaoliang Nie

TL;DR
This paper introduces COFFEE, a framework for achieving counterfactual fairness in personalized explanation generation for recommendation systems, addressing biases inherited from language models and promoting fairer user treatment.
Contribution
The paper proposes a novel framework for measure-specific counterfactual fairness in personalized explanation generation, improving fairness in recommendation explanations.
Findings
Effective reduction of bias in explanation generation
Improved fairness metrics demonstrated through experiments
Positive human evaluation results
Abstract
As language models become increasingly integrated into our digital lives, Personalized Text Generation (PTG) has emerged as a pivotal component with a wide range of applications. However, the bias inherent in user written text, often used for PTG model training, can inadvertently associate different levels of linguistic quality with users' protected attributes. The model can inherit the bias and perpetuate inequality in generating text w.r.t. users' protected attributes, leading to unfair treatment when serving users. In this work, we investigate fairness of PTG in the context of personalized explanation generation for recommendations. We first discuss the biases in generated explanations and their fairness implications. To promote fairness, we introduce a general framework to achieve measure-specific counterfactual fairness in explanation generation. Extensive experiments and human…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Recommender Systems and Techniques · Machine Learning in Healthcare
