A Multi-Strategy based Pre-Training Method for Cold-Start Recommendation
Bowen Hao, Hongzhi Yin, Jing Zhang, Cuiping Li, and Hong Chen

TL;DR
This paper introduces a multi-strategy pre-training approach combining GNNs and Transformers with contrastive learning to enhance cold-start recommendation performance, addressing limitations of existing models like PT-GNN.
Contribution
The proposed MPT method extends PT-GNN by integrating a Transformer encoder and contrastive learning, improving the capture of long-range and inter-correlations for cold-start scenarios.
Findings
MPT outperforms vanilla GNN models on three datasets.
MPT surpasses pre-trained GNNs on embedding inference tasks.
MPT demonstrates superior recommendation accuracy.
Abstract
Cold-start problem is a fundamental challenge for recommendation tasks. The recent self-supervised learning (SSL) on Graph Neural Networks (GNNs) model, PT-GNN, pre-trains the GNN model to reconstruct the cold-start embeddings and has shown great potential for cold-start recommendation. However, due to the over-smoothing problem, PT-GNN can only capture up to 3-order relation, which can not provide much useful auxiliary information to depict the target cold-start user or item. Besides, the embedding reconstruction task only considers the intra-correlations within the subgraph of users and items, while ignoring the inter-correlations across different subgraphs. To solve the above challenges, we propose a multi-strategy based pre-training method for cold-start recommendation (MPT), which extends PT-GNN from the perspective of model architecture and pretext tasks to improve the cold-start…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRecommender Systems and Techniques · Advanced Graph Neural Networks · Topic Modeling
MethodsAttention Is All You Need · Linear Layer · Contrastive Learning · Absolute Position Encodings · Softmax · Residual Connection · Adam · Dropout · Position-Wise Feed-Forward Layer · Layer Normalization
