A Semi-Synthetic Dataset Generation Framework for Causal Inference in Recommender Systems
Yan Lyu, Sunhao Dai, Peng Wu, Quanyu Dai, Yuhao Deng, Wenjie Hu,, Zhenhua Dong, Jun Xu, Shengyu Zhu, Xiao-Hua Zhou

TL;DR
This paper introduces a semi-synthetic data generation framework using causal graphical models to create datasets with underlying reasons for user ratings, enhancing causal inference and explanation in recommender systems.
Contribution
The authors propose a novel framework employing causal graphical models with missingness to generate semi-synthetic datasets for recommender systems, enabling causal analysis beyond traditional prediction tasks.
Findings
Constructed a semi-synthetic dataset (CTAR) with causal tags and ratings.
Demonstrated the framework's ability to generate explainable user-item interactions.
Provided baseline results and statistics for the CTAR dataset.
Abstract
Accurate recommendation and reliable explanation are two key issues for modern recommender systems. However, most recommendation benchmarks only concern the prediction of user-item ratings while omitting the underlying causes behind the ratings. For example, the widely-used Yahoo!R3 dataset contains little information on the causes of the user-movie ratings. A solution could be to conduct surveys and require the users to provide such information. In practice, the user surveys can hardly avoid compliance issues and sparse user responses, which greatly hinders the exploration of causality-based recommendation. To better support the studies of causal inference and further explanations in recommender systems, we propose a novel semi-synthetic data generation framework for recommender systems where causal graphical models with missingness are employed to describe the causal mechanism of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRecommender Systems and Techniques · Advanced Graph Neural Networks · Explainable Artificial Intelligence (XAI)
